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TRANSPORTERS AND ION CHANNELS 

This invention relates to nucleic acid and amino acid sequences of transporters and ion 
channels and to the use of these sequences in the diagnosis, treatment, and prevention of transport, 
5 neurological, muscle, immunological and cell proliferative disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

BACKGROUND OF THE INVENTION 

10 Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 

hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells 
and organelles require transport proteins to import and export essential nutrients and metal ions 
including K + , NH 4 + , P 4 , S0 4 2 \ sugars, and vitamins, as well as various metabolic waste products. 
Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic 

15 neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse cell 
functions (Griffith, J. and C. Sansom (1998) The Transporter Facts Book . Academic Press, San 
Diego CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can 
be linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 

20 change that translocates the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in 
the opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety 
of symporter systems driven by the sodium gradient that exists across the plasma membrane. 
Sodium moves into the cell down its electrochemical gradient and brings the solute into the cell 
with it. The sodium gradient that provides the driving force for solute uptake is maintained by the 

30 ubiquitous Na"7K + ATPase system. Sodium-coupled transporters include the mammalian glucose 
transporter (SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three 
transporters have twelve putative transmembrane segments, extracellular glycosylation sites, and 
cytoplasmically-oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, 
and treatment of various thyroid pathologies because it is the molecular basis for radioiodide 

35 thyroid-imaging techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, 
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0. et al. (1997) Proc, Natl. Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal 
mucosa, kidney, and placenta, and is implicated in the transport of the water-soluble vitamins, e.g., 
biotin and pantothenate (Prasad, P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 

One of the largest families of transporters is the major facilitator superfamily (MFS), also 

5 called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all 
classes of living organisms, and include transporters for sugars, oligosaccharides, phosphates, 
nitrates, nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have 
a structure comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. 

10 Rev. 62: 1-34). The largest family of MFS transporters is the sugar transporter family, which 

includes the seven glucose transporters (GLUT1-GLUT7) found in humans that are required for the 
transport of glucose and other hexose sugars. These glucose transport proteins have unique tissue 
distributions and physiological functions. GLUT1 provides many cell types with their basal glucose 
requirements and transports glucose across epithelial and endothelial barrier tissues; GLUT2 

15 facilitates glucose uptake or efflux from the liver; GLUT3 regulates glucose supply to neurons; 
GLUT4 is responsible for insulin-regulated glucose disposal; and GLUTS regulates fructose uptake 
into skeletal muscle. Defects in glucose transporters are involved in a recently identified 
neurological syndrome causing infantile seizures and developmental delay, as well as glycogen 
storage disease, Fanconi-Bickel syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, 

20 M. (1994) Eur. J. Biochem. 219:713-725; Longo, N. and L.J. Elsas (1998) Adv. Pediatr. 45:293- 
313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isofonns have been identified to date. The isoforms are 

25 predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop 
between TM6 and TM7, and play a critical role in maintaining intracellular pH by removing the 
protons that are produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -monocarboxylate transporter is that of the erythrocyte membrane, which transports L-lactate and 
a wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 

30 transporters with differing substrate and inhibitor seiectivities. In particular, cardiac muscle and 
tumor cells have transporters that differ in their values for certain substrates, including 
stereoselectivity for L- over D-lactate, and in their sensitivity to inhibitors. There are 
Na + -monocarboxylate cotransporters on the luminal surface of intestinal and kidney epithelia, which 
allow the uptake of lactate, pyruvate, and ketone bodies in these tissues. In addition, there are 

35 specific and selective transporters for organic cations and organic anions in organs including the 
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kidney, intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the ammonium 
transporter, mediate the secretion of a variety of drugs and endogenous metabolites, and contribute 
to the maintenance of intercellular pH (Poole, R.C. and A.P. Halestrap (1993) Am J. Physiol. 

5 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and 1 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 

10 transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze 
ATP to supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single 
gene, as is the case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a single NBD and MSD. These "half- 

15 molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to 
defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
Fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic 

20 hypoglycemia (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) 
protein, another ABC transporter, in human cancer cells makes the cells resistant to a variety of 
cytotoxic drugs used in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 
292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
25 selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
copper is involved in hemoglobin synthesis, connective tissue metabolism, and bone development, 
by acting as a cofactor in oxidoreductases such as superoxide dismutase, ferroxidase 
(ceruloplasmin), and lysyl oxidase. Copper and other metal ions must be provided in the diet, and 
are absorbed by transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to 
30 the liver and other target organs, where specific transporters move the ions into cells and cellular 
organelles as needed. Imbalances in metal ion metabolism have been associated with a number of 
disease states (Danks, D.M. (1986) J. Med. Genet. 23:99-106). 

P-type ATPases comprise a class of cation-transporting transmembrane proteins. They are 
integral membrane proteins which use an aspartyl phosphate intermediate to move cations across a 
35 membrane. Features of P-type ATPases include: (i) a cation channel; (ii) a stalk, formed by 
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extensions of the transmembrane a-helices into the cytoplasm; (iii) an ATP binding domain; (iv) a 
phosphorylated aspartic acid; (v) an adjacent transduction domain; (vi) a phosphatase domain, 
which removes the phosphate from the aspartic acid as part of the reaction cycle; and (vii) six or 
more transmembrane domains. Included in this class are heavy metal-transporting ATPases as well 
5 as aminophospholipid transporters. 

The transport of phosphatidylserine and phosphatidylethanolamine by aminophospholipid 
translocase results in the movement of these molecules from one side of a bilayer to another. This 
transport is conducted by a newly identified subfamily of P-type ATPases which are proposed to be 
amphipath transporters. Amphipath transporters move molecules having both a hydrophilic and a 

10 hydrophobic region. As many as seventeen different genes belong to this P-type ATPases 

subfamily, being grouped into several distinct classes and subclasses (Halleck, M.S. et al., (1999) 
Physiol, Genomics 1:139-150; Vulpe.C. Et aL, (1993) Nat. Genet 3:7-13). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high 
capacity, low affinity process. However, under normal physiological conditions a significant 

15 fraction of fatty acid transport appears to occur via a high affinity, low capacity protein-mediated 
. transport process. Fatty acid transport protein (FATP), an integral membrane protein with four 
transmembrane segments, is expressed in tissues exhibiting high levels of plasma membrane fatty 
acid flux, such as muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells 
during adipose conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty 

20 acids (Hui, T.Y. et al. (1998) J. Biol. Chem. 273:27420-27429). 

The lipocalin superfamily constitutes a phylogenetically conserved group of more than forty 
proteins that function as extracellular ligand-binding proteins which bind and transport small 
hydrophobic molecules. Members of this family function as carriers of retinoids, odorants, 
chromophores, pheromones, allergens, and sterols, and in a variety of processes including nutrient 

25 transport, cell growth regulation, immune response, and prostaglandin synthesis. A subset of these 
proteins may be multifunctional, serving as either a biosynthetic enzyme or as a specific enzyme 
inhibitor. (Tanaka, T. et al. (1997) J. Biol. Chem. 272:15789-15795; and van't Hof, W. et al. (1997) 
J. Biol. Chem. 272:1837-1841.) 

Members of the lipocalin family display unusually low levels of overall sequence 

30 conservation. Pairwise sequence identity often falls below 20%. Sequence similarity between 
family members is limited to conserved cysteines which form disulfide bonds and three motifs 
which form a juxtaposed cluster that functions as a target cell recognition site. The lipocalins share 
an eight stranded, anti-parallel beta-sheet which folds back on itself to form a continuously 
hydrogen-bonded beta-barrel. The pocket formed by the barrel functions as an internal ligand 

35 binding site. Seven loops (LI to L7) form short beta-hairpins, except loop LI which is a large 
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omega loop that forms a lid to partially close the internal Hgand-binding site (Flower (1996) 
Biochem. 1318:1-14). 

Lipocalins are important transport molecules. Each lipocalin associates with a particular 
Hgand and delivers that iigand to appropriate target sites within the organism. Retinol-binding 

5 protein (RBP), one of the best characterized lipocalins, transports retinol from stores within the liver 
to target tissues. Apolipoprotein D (apo D), a component of high density lipoproteins (HDLs) and 
low density lipoproteins (LDLs), functions in the targeted collection and delivery of cholesterol 
throughout the body. Lipocalins are also involved in cell regulatory processes. Apo D, which is 
identical to gross-cystic-disease-fluid protein (GCDFP)-24, is a progesterone/pregnenolone-binding 

10 protein expressed at high levels in breast cyst fluid. Secretion of apo D in certain human breast 
cancer cell lines is accompanied by reduced cell proliferation and progression of cells to a more 
differentiated phenotype. Similarly, apo D and another lipocalin, a r acid glycoprotein (AGP), are 
involved in nerve cell regeneration. AGP is also involved in anti-inflammatory and 
immunosuppressive activities. AGP is one of the positive acute-phase proteins (APP); circulating 

15 levels of AGP increase in response to stress and inflammatory stimulation. AGP accumulates at 
sites of inflammation where it inhibits platelet and neutrophil activation and inhibits phagocytosis. 
The immunomodulatory properties of AGP are due to glycosylation. AGP is 40% carbohydrate, 
making it unusually acidic and soluble. The glycosylation pattern of AGP changes during acute- 
phase response, and deglycosylated AGP has no immunosuppressive activity (Flower (1994) FEBS 

20 Lett 354:7-1 1 ; Flower (1996) supra) . 

The lipocalin superfamily also includes several animal allergens, including the mouse major 
urinary protein (mMUP), the rat a-2-microgloobulin (rA2XJ), the bovine p-lactoglobulin (pig), the 
cockroach allergen (Bla g4), bovine dander allergen (Bos d2), and the major horse allergen, 
designated Equus caballus allergen 1 (Equ cl). Equ cl is a powerful allergen responsible for about 

25 80% of anti-horse IgE antibody response in patients who are chronically exposed to horse allergens. 
It appears that lipocalins may contain a common structure that is able to induce the IgE response 
(Gregoire, C. et al, (1996) J. Biol Chem. 271:32951-32959). 

Lipocalins are used as diagnostic and prognostic markers in a variety of disease states. The 
plasma level of AGP is monitored during pregnancy and in diagnosis and prognosis of conditions 

30 including cancer chemotherapy, renal disfunction, myocardial infarction, arthritis, and multiple 
sclerosis. RBP is used clinically as a marker of tubular reabsorption in the kidney, and apo D is a 
marker in gross cystic breast disease (Flower (1996) supra) . Additionally, the use of lipocalin 
animal allergens may help in the diagnosis of allergic reactions to horses (Gregoire supra ), pigs, 
cockroaches, mice and rats. 

35 Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions 
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and charged metabolites between the cytosol and the mitochondrial matrix. Examples include the 
ADP, ATP carrier protein; the 2-oxoglutarate/raalate carrier; the phosphate carrier protein; the 
pyruvate carrier; the dicarboxylate carrier which transports malate, succinate, fumarate, and 
phosphate; the tricarboxylate carrier which transports citrate and malate; and the Grave's disease 

5 carrier protein, a protein recognized by IgG in patients with active Grave's disease, an autoimmune 
disorder resulting in hyperthyroidism. Proteins in this family consist of three tandem repeats of an 
approximately 100 amino acid domain, each of which contains two transmembrane regions (Stryer, 
L. (1995) Biochemistry , W.H. Freeman and Company, New York NY, p. 551; PROSITE 
PDOC00189 Mitochondrial energy transfer proteins signature; Online Mendelian Inheritance in 

10 Man (OMM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 

15 proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. 
(1999) J. Int. Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement 
of ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 

20 selective pores within the membrane. There are two basic types of ion channels, ion transporters 
and gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient Gated ion channels allow passive flow of 
an ion down the ion's electrochemical gradient under restricted conditions. Together, these types of 
ion channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical 

25 impulse conduction down the axon of a nerve cell, 2) transport of molecules into cells against 
concentration gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 

30 These transmembrane ATPases are divided into three families. The phosphorylated (?) class ion 
transporters, including Na + -K + ATPase, Ca 2 *-ATPase, and H + -ATPase, are activated by a 
phosphorylation event. P~class ion transporters, also known as E1-E2 type ATPases, are responsible 
for maintaining resting potential distributions such that cytosolic concentrations of Na* and Ca 2+ are 
low and cytosolic concentration of K + is high. The vacuolar (V) class of ion transporters includes 

35 H + pumps on intracellular organelles, such as lysosomes and Golgi. V-class ion transporters are 
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responsible for generating the low pH within the lumen of these organelles that is required for 
function. The coupling factor (F) class consists of H + pumps in the mitochondria. F-class ion 
transporters utilize a proton gradient to generate ATP from ADP and inorganic phosphate (P 5 ). 
The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 

5 several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) 
Curr. Opin. Cell Biol 11:517-522). The V-ATPases are composed of two functional domains: the 
V, domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F-ATPases are structurally 
and evolutionarily related to the V-ATPases. The F-ATPase F 0 domain contains 12 copies of the c 

10 subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport The V-ATPase V 0 
domain contains three types of homologous c subunits with four or five transmembrane domains 
and the essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a 
subunit that may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. 

15 Biol. Chern. 274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out 
of the ceil. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
(symport) so that the movement of Na* down an electrochemical gradient drives transport of the 

20 other molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2 * out of 
the cell with transport of Na* into the cell (antiport). 
Gated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. The 
ability to control ion flux through various gating mechanisms allows ion channels to mediate such 

25 diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle 

contraction, fertilization, and regulation of ion and pH balance. Gated ion channels are categorized 
according to the manner of regulating the gating function. Mechanically-gated channels open their 
pores in response to mechanical stress; voltage-gated channels (e.g., Na + , K*, Ca*\ and Ci" channels) 
open their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 

30 acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
translational modifications, such as phosphorylation. 

35 Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
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touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently 
cloned from rat kidney. The SIC channel belongs to a group of channels which are activated by 
pressure or stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et al. (1999) J. 

5 Biol. Chem. 274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 
carboxy termini. In the Na + and Ca 2+ subfamilies, this domain is repeated four times, while in the 

10 K* channel subfamily, each channel is formed from a tetramer of either identical or dissimilar 
subunits. The P region contains information specifying the ion selectivity for the channel. In the 
case of K + channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:11651-11656). 

Voltage-gated Na + and K* channels are necessary for the function of electrically excitable 

15 cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 
muscle contraction, arise from large, transient changes in the permeability of the membrane to Na + 
and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 
voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 

20 Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged 
residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts 
only about 1 millisecond, at which time the channel spontaneously converts into an inactive state 
that cannot be opened irrespective of the membrane potential. Inactivation is mediated by the 

25 channel's N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a 
closed state requires a return to resting potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, pi and p2. The J32 subunit is 
a integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 

30 and pi subunits correlates with increased functional expression of the channel, a change in its 
gating properties, as well as an increase in whole cell capacitance due to an increase in membrane 
surface area (Isom, L.L. et al. (1995) Cell 83:433^42). 

Non voltage-gated Na + channels include the members of the amiloride-sensitive Na + 
channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 

35 transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini 



8 



WO 02/46415 



PCT/US01/4696J 



located within the cell. The NaC/DEG family includes the epithelial Na + channel (ENaC) involved 
in Na + reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the 
kidney, and exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 
and Liddle's syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the 
5 recently characterized HP-gated cation channels or acid-sensing ion channels (ASIC). ASIC 
subunits are expressed in the brain and form heteromultimeric Na*-permeable channels. These 
channels require acid pH fluctuations for activation. ASIC subunits show homology to the 
degenerins, a family of mechanically-gated channels originally isolated from C. elegans. Mutations 
in the degenerins cause neurodegeneration. ASIC subunits may also have a role in neuronal 
10 function, or in pain perception, since tissue acidosis causes pain (Waldmann, R. and M. Lazdunski 
(1998) Curr. Opin. Neurobiol. 8:418-424; Eglen, R.M. et al. (1999) Trends Pharmacol. Sci. 20:337- 
342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels 

15 are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, K + channels are responsible for setting the resting 
membrane potential. The cytosol contains non-diffusible anions and, to balance this net negative 
charge, the cell contains a Na + -K* pump and ion channels that provide the redistribution of Na + , K\ 

20 and Cl\ The pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion 
channels in the plasma membrane allow K* and CI * to flow by passive diffusion. Because of the 
high negative charge within the cytosol, CI' flows out of the cell. The flow of K + is balanced by an 
electromotive force pulling K* into the cell, and a K + concentration gradient pushing K + out of the 
cell. Thus, the resting membrane potential is primarily regulated by K + flow (Salkoff, L. and T. 

25 Jegla (1995) Neuron 15:489-492). 

Potassium channel subunits of the Shaker- like superfamily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to 
form functional K channels. These pore-forming subunits also associate with various cytoplasmic f} 
subunits that alter channel inactivation kinetics. The Shaker -like channel family includes the 

30 voltage-gated K* channels as well as the delayed rectifier type channels such as the human ether-a- 
go-go related gene (HERG) associated with long QT, a cardiac dysry thmia syndrome (Curran, M.E. 
(1998) Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, GJ. and M.L. Garcia (1999) Curr. Opin. 
Chem. Biol. 3:448-458). 

A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 

35 Kir channels have the property of preferentially conducting K + currents in the inward direction. 
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These proteins consist of a single potassium selective pore domain and two transmembrane 
domains, which correspond to the fifth and sixth transmembrane domains of voltage-gated K + 
channels. Kir subunits also associate as tetramers. The Kir family includes ROMK1, mutations in 
which lead to Bartter syndrome, a renal tubular disorder. Kir channels are also involved in 

5 regulation of cardiac pacemaker activity, seizures and epilepsy, and insulin regulation (Doupnik, 
CA. et al. (1995) Curr. Opin. Neurobiol. 5:268-277; Curran, supra ). 

The recently recognized TWEK K* channel family includes the mammalian TWIK-1, 
TREK-1 and TASK proteins. Members of this family possess an overall structure with four 
transmembrane domains and two P domains. These proteins are probably involved in controlling 

10 the resting potential in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2 * channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2+ channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 
coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 

15 channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2 * channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed 
of three subunits. The a, subunit forms the membrane pore and voltage sensor, while the 0^5 and p 
subunits modulate the voltage-dependence, gating properties, and the current amplitude of the 

20 channel. These subunits are encoded by at least six a,, one OjS, and four p genes. A fourth subunit, 
y, has been identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; 
McCleskey, E.W. (1994) Curr. Opin. Neurobiol. 4:304-312). 

The high-voltage-activated Ca 2+ channels that have been characterized biochemically 
include complexes of a pore-forming alphal subunit of approximately 190-250 kDa; a 

25 transmembrane complex of alpha2 and delta subunits; an intracellular beta subunit; and in some 
cases a transmembrane gamma subunit. A variety of alphal subunits, alpha2delta complexes, beta 
subunits, and gamma subunits are known. The Cavl family of alphal subunits conduct L-type Ca 2+ 
currents, which initiate muscle contraction, endocrine secretion, and gene transcription, and are 
regulated primarily by second messenger-activated protein phosphorylation pathways. The Cav2 

30 family of alphal subunits conduct N-type, P/Q-type, and R-type Ca 2+ currents, which initiate rapid 
synaptic transmission and are regulated primarily by direct interaction with G proteins and SNARE 
proteins and secondarily by protein phosphorylation. The Cav3 family of alphal subunits conduct 
T-type Ca 2+ currents, which are activated and inactivated more rapidly and at more negative 
membrane potentials than other Ca 2+ current types. The distinct structures and patterns of 

35 regulation of these three families of Ca 2 * channels provide an array of Ca 2 * entry pathways in 
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response to changes in membrane potential and a range of possibilities for regulation of Ca 2+ entry 
by second messenger pathways and interacting proteins (Catterall, W. A. (2000) Annu. Rev. Cell 
Dev. Biol. 16:521-555). 

The alpha-2 subunit of the voltage-gated Ca 2+ -channel may include one or more Cache 

5 domains. An extracellular Cache domain may be fused to an intracellular catalytic domain, such as 
the histidine kinase, PP2C phosphatase, GGDEF (a predicted diguanylate cyclase), HD-GYP (a 
predicted phosphodiesterase) or adenylyl cyclase domain, or to a noncatalytic domain, like the 
methyl-accepting, DNA-binding winged helix-tum-helix, GAF, PAS or HAMP (a domain found in 
istidine kinases, denylyl cyclases, ethyl-binding proteins and phosphatases). Small molecules are 

10 bound via the Cache domain and this signal is converted into diverse outputs depending on the 
intracellular domains (Anantharaman, V. and Aravind, L.(2000) Trends Biochem. Sci. 25:535-537). 

The transient receptor family (Trp) of calcium ion channels are thought to mediate 
capacitative calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2 * stores depleted 
by the action of inositol triphosphate (IP3) and other agents in response to numerous hormones and 

15 growth factors. Trp and Trp-like were first cloned from Drosophila and have similarity to voltage 
gated Ca 2+ channels in the S3 through S6 regions. This suggests that Trp and/or related proteins 
may form mammalian CCE channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chera 272:29672-29680). Melastatin is a gene isolated in both the mouse and human, 
whose expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo . 

20 The human cDNA transcript corresponds to a 1533-arnino acid protein having homology to 
members of the Trp family. It has been proposed that the combined use of malastatin mRNA 
expression status and tumor thickness might allow for the determination of subgroups of patients at 
both low and high risk for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 
19:568-576). 

25 Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

organelle pH. In secretory epithelial cells, CI" enters the cell across a basolateral membrane through 
an Na + , K7C1' cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI " from the apical surface, in response to hormonal stimulation, leads 
to flow of Na + and water into the secretory lumen. The cystic fibrosis transmembrane conductance 

30 regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal 
genetic disorder in humans. CFTR is a member of the ABC transporter family, and is composed of 
two domains each consisting of six transmembrane domains followed by a nucleotide-binding site. 
Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. 

35 The resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and 
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devastating "chronic obstructive pulmonary disease" (AI-Awqati, Q. et al. (1992) J. Exp. Biol. 
172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits 
5 probably function as homotetramers. CLC proteins are involved in regulation of cell volume, 
membrane potential stabilization, signal transduction, and transepithelial transport. Mutations in 
CLC-1, expressed predominantly in skeletal muscle, are responsible for autosomal recessive 
generalized myotonia and autosomal dominant myotonia congenita, while mutations in the kidney 
channel CLC-5 lead to kidney stones (Jentsch, TJ. (1996) Curr. Opin. Neurobiol. 6:303-310). 

10 Ligand-gated channels open their pores when an extracellular or intracellular mediator 

binds to the channel. Neurotransmitter-gated channels are- channels that open when a 
neurotransmitter binds to their extracellular domain. These channels exist in the postsynaptic 
membrane of nerve or muscle cells. There are two types of neurotransmitter-gated channels. 
Sodium channels open in response to excitatory neurotransmitters, such as acetylcholine, glutamate, 

15 and serotonin. This opening causes an influx of Na + and produces the initial localized 

depolarization that activates the voltage-gated channels and starts the action potential. Chloride 
channels open in response to inhibitory neurotransmitters, such as 7-aminobutyric acid (GAB A) and 
glycine, leading to hyperpolarization of the membrane and the subsequent generation of an action 
potential. Neurotransmitter-gated ion channels have four transmembrane domains and probably 

20 function as pentamers (Jentsch, supra) . Amino acids in the second transmembrane domain appear to 
be important in determining channel permeation and selectivity (Sather, W.A. et al. (1994) Curr. 
Opin. Neurobiol. 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of 

25 calcium during depolarization opens K* channels to modulate the magnitude of the action potential 
(Ishi et al., supra) . The large conductance (BK) channel has been purified from brain and its subunit 
composition determined. The a subunit of the BK channel has seven rather than six transmembrane 
domains in contrast to voltage-gated K* channels. The extra transmembrane domain is located at 
the subunit N-terminus. A 28-amino-acid stretch in the C-terminal region of the subunit (the 

30 "calcium bowl" region) contains many negatively charged residues and is thought to be the region 
responsible for calcium binding. The P subunit consists of two transmembrane domains connected 
by a glycosylated extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra : 
Vergara, C. et al. (1998) Curr. Opin. Neurobiol. 8:321-329). 

Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 

35 examples of these are the cAMP-gated Na* channels involved in olfaction and the cGMP-gated 
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cation channels involved in vision. Both systems involve ligand-mediated activation of a G-protein 
coupled receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also 
represent a major pathway for Ca 2 * entry into neurons, and play roles in neuronal development and 
plasticity. CNG channels are tetramers containing at least two types of subunits, an a subunit which 

5 can form functional homomeric channels, and a P subunit, which modulates the channel properties. 
All CNG subunits have six transmembrane domains and a pore forming region between the fifth and 
sixth transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain 
contains a cyclic nucleotide binding domain, while the N-terminal domain confers variation among 
channel subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

10 The activity of other types of ion channel proteins may also be modulated by a variety of 

intracellular signalling proteins. Many channels have sites for phosphorylation by one or more 
protein kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase n, 
all of which regulate ion channel activity in cells. Kir channels are activated by the binding of the 
G(fy subunits of heterotrimeric G-proteins (Reimann, R and F.M. Ashcroft (1999) Curr. Opin. Cell. 

15 Biol. 11:503-508). Other proteins are involved in the localization of ion channels to specific sites in 
the cell membrane. Such proteins include the PDZ domain proteins known as MAGUKs 
(membrane-associated guanylate kinases) which regulate the clustering of ion channels at neuronal 
synapses (Craven, S.E. and D.S. Bredt (1998) Cell 93:495-498). 
Disease Correlation 

20 The etiology of numerous human diseases and disorders can be attributed to defects in the 

transport of molecules across membranes. Defects in the trafficking of membrane-bound 
transporters and ion channels are associated with several disorders, e.g., cystic fibrosis, 
glucose-galactose malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain 
forms of diabetes mellitus. Single-gene defect diseases resulting in an inability to transport small 

25 molecules across membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi 
disease (van* Hoff, W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al (1994) Ann. Intern. 
Med. 120:218-226; and Chillon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of 

30 sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 
Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, while 
mutations in the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle 
sodium channel cause malignant hyperthermia. Cardiac arrythmia disorders such as the long QT 

35 syndromes and idiopathic ventricular fibrillation are caused by mutations in potassium and sodium 
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channels (Cooper, E.C. and L.Y. Jan (1993) Proc. Natl. Acad. Sci. USA 96:4759-4766). Ail four 
known human idiopathic epilepsy genes code for ion channel proteins (Berkovic, S.F. and LE. 
Scheffer (1999) Curr. Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, 
heraiplegic migraine and hereditary deafness can also result from mutations in ion channel genes 

5 (Jen, J. (1999) Curr. Opin. Neurobiol. 9:274-280; Cooper, supra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head 
trauma, and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 

10 39:47-98). Various classes of ion channels also play an important role in the perception of pain, and 
thus are potential targets for new analgesics. These include the vanilloid-gated ion channels, which 
are activated by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as 
lidocaine and mexiletine which blockade voltage-gated Na+ channels have been useful in the 
treatment of neuropathic pain (Eglen, supra) . Calcium-channel protein expression is altered in 

15 metastatic melanomas (Enklaar, T. et al. (2000) Genomics 67: 179-187. 

Ion channels in the immune system have recently been suggested as targets for 
immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide 

20 antagonist of the T-cell potassium channel Kvl.3 was found to suppress delay ed-type 

hypersensitivity and allogenic responses in pigs, validating the idea of channel blockers as safe and 
efficacious immunosuppressants (Cahalan, M.D. and K.G. Cbandy (1997) Curr. Opin. Biotechnol. 
8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding 
25 them, satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, neurological, muscle, immunological and cell proliferative 
disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of transporters and ion channels. 

30 SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 
collectively as "TRICH" and individually as "TRICH-1," 'TRICH-2," 'TRICH-3," 'TRICH-4," 
"TRICH-5," 'TRICH-6," t TRICH-7/' "TRICH-8," 'TRICH-9," "TRICH-10," "TRICH-ll," 
"TRICH-12," "TRICH-13," 4 TRICH-14," "TRICH-15," 'TRICH-16," "TRICH-17," 'TRICH-18," 
35 "TRICH-19," 'TRICH-20," 'TRICH-21," *TRICH-22," "TRICH-23," 'TRICH-24," 'TRICH-25," 
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•TRICH-26," "TRICH-27," 'TRICH-28," "TRICH-29," •TRICHOO," "TRJCH-31," and "TRICH- 
32." In one aspect, the invention provides an isolated polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 

5 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides an isolated polypeptide comprising the amino acid 

10 sequence of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of 

15 SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ED NO: 1-32. In another alternative, the polynucleotide is selected from the group consisting of 

20 SEQIDNO:33-64. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 

25 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. 

30 In another alternative, the invention provides a transgenic organism comprising the recombinant 
polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
35 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
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SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
The method comprises a) culturing a cell under conditions suitable for expression of the 
5 polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
10 sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
15 consisting of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
20 group consisting of SEQ ID NO:33-64, c) a polynucleotide complementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)- 
d). In one alternative, the polynucleotide comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 
25 consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:33-64, c) a polynucleotide complementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)- 
30 d). The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization complex, and optionally, if 
35 present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
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nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 

5 SEQ ID NO:33-<54, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

10 reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 

15 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ED NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, and a pharmaceutically acceptable excipient. In one embodiment, 

20 the composition comprises an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. The invention additionally provides a method of treating a disease or condition associated 
with decreased expression of functional TRICH, comprising administering to a patient in need of 
such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 

25 agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ED NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 

30 immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 

35 treating a disease or condition associated with decreased expression of functional TRICH, 
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comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide 

5 comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment 
of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample 

10 comprising the polypeptide to a compound, and b) detecting antagonist activity in the sample. In 
one alternative, the invention provides a composition comprising an antagonist compound identified 
by the method and a pharmaceutically acceptable excipient. In another alternative, the invention 
provides a method of treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment the composition. 

15 The invention further provides a method of screening for a compound that specifically 

binds to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 

20 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) combining the polypeptide with at least 
one test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 

25 The invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) combining the polypeptide with at least 
one test compound under conditions permissive for the activity of the polypeptide, b) assessing the 
activity of the polypeptide in the presence of the test compound, and c) comparing the activity of 

35 the polypeptide in the presence of the test compound with the activity of the polypeptide in the 
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absence of the test compound, wherein a change in the activity of the polypeptide in the presence of 
the test compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
5 polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

10 The invention further provides a method for assessing toxicity of a test compound, said 

method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, ii) 

15 a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO: 33-64, Hi) a 
polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i>iv). Hybridization occurs under conditions 
whereby a specific hybridization complex is formed between said probe and a target polynucleotide 

20 in the biological sample, said target polynucleotide selected from the group consisting of i) a 

polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, H) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33- 
64, Hi) a polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide 

25 complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the 
target polynucleotide comprises a fragment of a polynucleotide sequence selected from the group 
consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing the 
amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in the amount of hybridization 

30 complex in the treated biological sample is indicative of toxicity of the test compound. 



BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention. 
35 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
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homolog for polypeptides of the invention. The probability scores for the matches between each 
polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used 
5 for analysis of the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
10 Table 6 provides an appendix which describes the tissues and vectors used for construction 

of the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

15 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 

20 purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the'* include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 

25 antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in 
the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 

30 herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 

35 prior invention. 
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DEFINITIONS 

'TRICH' refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, 
and human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
5 The term "agonist" refers to a molecule which intensifies or mimics the biological activity 

of TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any 
other compound or composition which modulates the activity of TRICH either by directly 
interacting with TRICH or by acting on components of the biological pathway in which TRICH 
participates. 

10 An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants 

may result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs 
or in polypeptides whose structure or function may or may not be altered. A gene may have none, 
one, or many allelic variants of its naturally occurring form. Common mutational changes which 
give rise to allelic variants are generally ascribed to natural deletions, additions, or substitutions of 

15 nucleotides. Each of these types of changes may occur alone, or in combination with the others, one 
or more times in a given sequence. 

"Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or 
a polypeptide with at least one functional characteristic of TRICH. Included within this definition 

20 are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic 
variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding TRICH. The encoded protein may also be "altered," and may contain deletions, 
insertions, or substitutions of amino acid residues which produce a silent change and result in a 

25 functionally equivalent TRICH. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the biological or immunological activity of TRICH is retained. For 
example, negatively charged amino acids may include aspartic acid and glutamic acid, and 
positively charged amino acids may include lysine and arginine. Amino acids with uncharged polar 

30 side chains having similar hydrophilicity values may include: asparagine and glutamine; and serine 
and threonine. Amino acids with uncharged side chains having similar hydrophilicity values may 
include: leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 

35 synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
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occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
5 Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological 
activity of TRICH. Antagonists may include proteins such as antibodies, nucleic acids, 
carbohydrates, small molecules, or any other compound or composition which modulates the 

10 activity of TRICH either by directly interacting with TRICH or by acting on components of the 
biological pathway in which TRICH participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2l and Fv fragments, which are capable of binding an epitopic 
determinant. Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or 

15 using fragments containing small peptides of interest as the immunizing antigen. The polypeptide 
or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from 
the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if 
desired. Commonly used carriers that are chemically coupled to peptides include bovine serum 
albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used 

20 to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 

25 on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 

30 5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2-OH group of 
a ribonucleotide may be replaced by 2-F or 2'-NH 2 ), which may improve a desired property, e.g., 

35 resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
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e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13.) 

The term "intramer" refers to an aptamer which is expressed in vivo . For example, a 
5 vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
high levels in the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl Acad. Sci. USA 
96:3606-3610), 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
10 nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
15 phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as T-methoxyethyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7Kteaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
20 nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 
The terra "biologically active" refers to a protein having structural, regulatory, or 
' biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" or 
25 "immunogenic" refers to the capability of the natural, recombinant, or synthetic TRICH, or of any 
oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to 
bind with specific antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
30 3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding TRICH or 
35 fragments of TRICH may be employed as hybridization probes. The probes may be stored in freeze- 
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dried form and may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, 
the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., 
sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon 
sperm DNA, etc.). 

5 "Consensus sequence" refers to a nucleic acid sequence which has been subjected to 

repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit 
(Applied Biosystems, Foster City CA) in the 5' and/or the 3* direction, and resequenced, or which 
has been assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a 
computer program for fragment assembly, such as the GELVDEW fragment assembly system (GCG, 

10 Madison WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 

15 amino acids which may be substituted for an original amino acid in a protein and which are 
regarded as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 


20 


Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 


25 


Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




Ee 


Leu, Val 




Leu 


lie, Val 




Lys 


Arg, Gin, Glu 


30 


Met 


Leu, lie 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Tip 


Phe, Tyr 


35 


Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the 
polypeptide backbone in the area of the substitution, for example, as a beta sheet or alpha helical 
40 conformation, (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or 
(c) the bulk of the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
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absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by 
an alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 

5 retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

10 "Differential expression" refers to increased or upregulated; or decreased, downregulated, 

. or absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 

15 exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique portion of TRICH or the polynucleotide encoding TRICH which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise 

20 up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. 
A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at 
least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 

25 molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as 
shown in a certain defined sequence. Clearly these lengths are exemplary, and any length that is 
supported by the specification, including the Sequence Listing, tables, and figures, may be 
encompassed by the present embodiments. 

30 A fragment of SEQ ED NO: 33-64 comprises a region of unique polynucleotide sequence 

that specifically identifies SEQ ID NO:33-64, for example, as distinct from any other sequence in 
the genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 

35 ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
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determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. A 
fragment of SEQ ED NO: 1-32 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1-32. For example, a fragment of SEQ ID NO: 1-32 is useful as an 

5 immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1- 
32. The precise length of a fragment of SEQ ID NO: 1-32 and the region of SEQ ID NO: 1-32 to 
which the fragment corresponds are routinely determinable by one of ordinary skill in the art based 
on the intended purpose for the fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation 

10 codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between 
two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 

15 to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 

20 parameters of the CLUST AL V algorithm as incorporated into the MEGALIGN version 3 . 1 2e 

sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D-G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) 
CABIOS 8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters 

25 are set as follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" 
residue weight table is selected as the default. Percent identity is reported by CLUSTAL V as the 
"percent similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison 
algorithms is provided by the National Center for Biotechnology Information (NCB1) Basic Local 

30 Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is 
available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 

35 2 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
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Sequences" can be accessed and used interactively at http://www.ncbi.nlrn.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool 
5 Version 2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
10 Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off : 50 

Expect: 10 

Word Size: 11 

Filter: on 

15 Percent identity may be measured over the length of an entire defined sequence, for 

example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined sequence, for instance, a 
fragment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 
contiguous nucleotides. Such lengths are exemplary only, and it is understood that any fragment 

20 length supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that 
changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic 

25 acid sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer 
to the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 

30 substitutions, explained in more detail above, generally preserve the charge and.hydrophobicity at 
the site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 

35 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
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penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
5 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
10 Gap x drop-off : 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 

15 for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at 
least 150 contiguous residues. Such lengths are exemplary only, and it is understood that any 
fragment length supported by the sequences shown herein, in the tables, figures or Sequence 

20 Listing, may be used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. j 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 

25 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of 

30 complementarity. Specific hybridization complexes form under permissive annealing conditions 
and remain hybridized after the "washing" step(s). The washing step(s) is particularly important in 
determining the stringency of the hybridization process, with more stringent conditions allowing 
less non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly 
matched. Permissive conditions for annealing of nucleic acid sequences are routinely determinable 

35 by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas 
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wash conditions may be varied among experiments to achieve the desired stringency, and therefore 
hybridization specificity. Permissive annealing conditions occur, for example, at 68°C in the 
presence of about 6 x SSC, about 1% (w/v) SDS, and about 100 jig/ml sheared, denatured salmon 
sperm DNA. 

5 Generally, stringency of hybridization is expressed, in part, with reference to the 

temperature under which the wash step is carried out. Such wash temperatures are typically 
selected to be about 5°C to 20°C lower than the thermal melting point (TJ for the specific sequence 
at a defined ionic strength and pH. The T m is the temperature (under defined ionic strength and pH) 
at which 50% of the target sequence hybridizes to a perfectly matched probe. An equation for 

10 calculating T m and conditions for nucleic acid hybridization are well known and can be found in 
Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring 
Harbor Press, Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 

15 for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 pig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under 

20 particular circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash 
conditions will be readily apparent to those of ordinary skill in the art. Hybridization, particularly 
under high stringency conditions, may be suggestive of evolutionary similarity between the 
nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their 
encoded polypeptides. 

25 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or R 0 t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 

30 substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 

35 of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
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cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide 
5 fragment of TRICH which is useful in any of the antibody production methods disclosed herein or 
known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
10 chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of TRICH. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of TRICH. 

The phrases 4 *nucleic acid" and "nucleic acid sequence" refer to a nucleotide, 
15 oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA 
of genomic or synthetic origin which may be single-stranded or double-stranded and may represent 
the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like 
material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
20 functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
25 comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
30 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
35 isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
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Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 

5 identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 
contiguous nucleotides of a known sequence. In order to enhance specificity, longer probes and 
primers may also be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 
60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. 

10 Probes and primers may be considerably longer than these examples, and it is understood that any 
length supported by the specification, including the tables, figures, and Sequence Listing, may be 
used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 

15 Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PGR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer 
pairs can be derived from a known sequence, for example, by using computer programs intended for 
that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, 

20 Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 

25 selection programs have incorporated additional features for expanded capabilities. For example, 
the PrimOU primer selection program (available to the public from the Genome Center at University 
of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 

30 Genome Research, Cambridge MA) allows the user to input a "misprinting library," in which 
sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for 
the selection of oligonucleotides for microarrays. (The source code for the latter two primer 
selection programs may also be obtained from their respective sources and modified to meet the 
user's specific needs.) The PrimeGen program (available to the public from the UK Human 

35 Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple 
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sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is 
useful for identification of both unique and conserved oligonucleotides and polynucleotide 
fragments. The oligonucleotides and polynucleotide fragments identified by any of the above 

5 selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited 
to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

10 that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, 

15 a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
20 expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3* untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 
25 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
30 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of 
ribose instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
35 cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
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cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 

5 structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A, M the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled 
A and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

10 removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

15 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 

20 expression by a particular cell type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a 
recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the insertion of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is 

25 selected based on the type of host cell being transformed and may include, but is not limited to, 
bacteriophage or viral infection, electroporation, heat shock, lipofection, and particle bombardment 
The term "transformed cells" includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 

30 limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a 

35 precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by 
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infection with a recombinant virus. The term genetic manipulation does not include classical 
cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant 
DNA molecule. The transgenic organisms contemplated in accordance with the present invention 
include bacteria, cyanobacteria, fungi, plants and animals. The isolated DNA of the present 
5 invention can be introduced into the host by methods known in the art, for example infection, 
transfection, transformation or transconjugation. Techniques for transferring the DNA of the 
present invention into such organisms are widely known and provided in references such as 
Sambrook et al. (1989), supra . 

A 'Variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence 

10 having at least 40% sequence identity to the particular nucleic acid sequence over a certain length of 
one of the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) set at default parameters. Such a pair of nucleic acids may show, for example, at 
least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater 

15 sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 
have significant identity to a reference molecule, but will generally have a greater or lesser number 
of polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 

20 reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single 
nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide 

25 base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, 
or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence 
having at least 40% sequence identity to the particular polypeptide sequence over a certain length of 
one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 

30 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at 
least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater sequence 
identity over a certain defined length of one of the polypeptides. 

35 THE INVENTION 
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The invention is based on the discovery of new human transporters and ion channels 
(TRICH), the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological and cell proliferative 
disorders. 

5 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 

10 sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as 
shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified 
by BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

15 polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. 
Column 3 shows the GenBank identification number (GenBank ID NO:) of the nearest GenBank 
homolog. Column 4 shows the probability scores for the matches between each polypeptide and its 
homolog(s). Column 5 shows the annotation of the GenBank homolog(s) along with relevant 

20 citations where applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 

25 phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package (Genetics Computer Group, 
Madison WI). Column 6 shows amino acid residues comprising signature sequences, domains, and 
motifs. Column 7 shows analytical methods for protein structure/function analysis and in some 
cases, searchable databases to which the analytical methods were applied. 

30 Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and 

these properties establish that the claimed polypeptides are transporters and ion channels. For 
example, SEQ ID NO:l is 83% identical to human sodium-hydrogen exchanger 6 (GenBank ID 
g2944233) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is S.le-242, which indicates the probability of obtaining the observed 

35 polypeptide sequence alignment by chance. SEQ ID NO: 1 also contains a sodium/hydrogen 
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exchanger family domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS, and other BLAST analyses provide further corroborative evidence 
that SEQ ID NO:l is a sodium-hydrogen exchange transporter. In another example, SEQ ID NO:7 
5 is 85% identical to Rattus norvegicus Na7,K + -ATPase alpha subunit (GenBarik ID g619915) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 0.0, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:7 also contains an E1-E2 ATPase domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 

10 based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO:7 
is a cation-transporting ATPase. In yet another example, SEQ ID NO: 13 is 77% identical to a 
human carrier-like protein (GenBank ID g3694661) as determined by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 5.5e-209, which indicates 

15 the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID 

NO: 13 also contains a mitochondrial energy transfer protein domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM>based PFAM database of 
conserved protein family domains. (See Table 3.) The presence of this domain is confirmed by 
BLIMPS, MOTIFS, and PROFILESCAN analyses, providing further corroborative evidence that 

20 SEQ ID NO: 13 is a transporter. Further, SEQ ID NO: 16 is 41% identical to human novel ATPase 
(GenBank ID g8979801) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is Lle-165, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO: 16 also contains an E1-E2 
ATPase domain as determined by searching for statistically significant matches in the hidden 

25 Markov model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) 
Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence 
that SEQ ID NO: 16 is a cation-transporting ATPase. In a further example, SEQ ID NO: 19 is 43% 
identical to Sinorhizobium sp. As4 ArsA, the catalytic subunit of the arsenic oxyanion-translocating 
ATPase (GenBank ID g5802945) as determined by the Basic Local Alignment Search Tool 

30 (BLAST). (See Table 2.) The BLAST probability score is 7.7e-125, which indicates the probability 
of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 19 also contains 
an anion-transporting ATPase domain as determined by searching for statistically significant 
matches in the hidden Markov model (HMM)-based PFAM database of conserved protein family 
domains. (See Table 3.) Data from additional BLAST and PROFILESCAN analyses provide 

35 further corroborative evidence that SEQ ID NO: 19 is an anion-transporting ATPase. In yet a further 
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example, SEQ ID NO:21 is 54% identical to a murine putative E1-E2 ATPase (GenBank ID 
g28577) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 5.2e-190, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO:21 also contains six transmembrane 

5 domains as determined using TMAP, a program which delineates transmembrane segments. (See 
Table 3.) Data from BLIMPS, and MOTIFS, analyses provide further corroborative evidence that 
SEQ ID NO:21 is an ATPase. In a futher example, SEQ ID NO:24 is 52% identical, from residue 
A77 to residue L1007, to rat NMDAR-L (GenBank ID g2160125) as determined by the Basic Local 
Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is L5e-262, which 

10 indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO: 24 also contains a ligand gated ion channel domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) Data from BLIMPS analysis provide further 
corroborative evidence that SEQ ID NO:24 is a glutamate receptor. SEQ ID NO:2-6, SEQ ID NO:8- 

15 12, SEQ ID NO:14-15, SEQ ID NO: 17-18, SEQ ID NO:20, SEQ ID NO:22-23, and SEQ ID NO:25- 
32 were analyzed and annotated in a similar manner. The algorithms and parameters for the 
analysis of SEQ ID NO: 1-32 are described in Table 7. 

As shown in Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

20 combination of these two types of sequences. Column 1 lists the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:), the corresponding Incyte polynucleotide 
consensus sequence number (Incyte ID) for each polynucleotide of the invention, and the length of 
each polynucleotide sequence in basepairs. Column 2 shows the nucleotide start (5') and stop (3') 
positions of the cDNA and/or genomic sequences used to assemble the full length polynucleotide 

25 sequences of the invention, and of fragments of the polynucleotide sequences which are useful, for 
example, in hybridization or amplification technologies that identify SEQ ID NO:33-64 or that 
distinguish between SEQ ID NO:33-64 and related polynucleotide sequences. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 
example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 

30 libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotide sequences. In 
addition, the polynucleotide fragments described in column 2 may identify sequences derived from 
the ENSEMBL (The Sanger Centre, Cambridge, UK) database (Le. 9 those sequences including the 
designation "ENST"). Alternatively, the polynucleotide fragments described in column 2 may be 

35 derived from the NCBI RefSeq Nucleotide Sequence Records Database (i.e M those sequences 
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including the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e., 
those sequences including the designation "NP"), Alternatively, the polynucleotide fragments 
described in column 2 may refer to assemblages of both cDNA and Genscan-predicted exons 
brought together by an "exon stitching" algorithm. For example, a polynucleotide sequence 

5 identified as FL JCXXXXXJJjJJ^YYYYYJf^ represents a "stitched" sequence in which 
XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is the number of the prediction generated by the algorithm, and N iA$mm , if 
present, represent specific exons that may have been manually edited during analysis (See Example 
V), Alternatively, the polynucleotide fragments in column 2 may refer to assemblages of exons 

10 brought together by an "exon-stretching" algorithm. For example, a polynucleotide sequence 
identified as FLXXXXXX_gAAAAA_gBBBBB_l JV is a "stretched" sequence, with XXXXXX being 
the Incyte project identification number, gAAAAA being the GenBank identification number of the 
human genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the 
GenBank identification number or NCBI RefSeq identification number of the nearest GenBank 

15 protein homolog, and N referring to specific exons (See Example V). In instances where a RefSeq 
sequence was used as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier 
(denoted by "NM," "NP," or "NT") may be used in place of the GenBank identifier (i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted 
from genomic DNA sequences, or derived from a combination of sequence analysis methods. The 

20 following Table lists examples of component sequence prefixes and corresponding sequence 
analysis methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
30 Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide 
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sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 
vectors which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

5 The invention also encompasses TRICH variants. A preferred TRICH variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TRICH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH. In a particular 

10 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:33-64, which encodes TRICH. The polynucleotide 
sequences of SEQ ID NO:33-64, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

15 The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. 

In particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRICH. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 

20 NO:33-64 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:33-64. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of TRICH. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant 

25 of a polynucleotide sequence encoding TRICH. A splice variant may have portions which have 
significant sequence identity to the polynucleotide sequence encoding TRICH, but will generally 
have a greater or lesser number of polynucleotides due to additions or deletions of blocks of 
sequence arising from alternate splicing of exons during mRNA processing. A splice variant may 
have less than about 70%, or alternatively less than about 60%, or alternatively less than about 50% 

30 polynucleotide sequence identity to the polynucleotide sequence encoding TRICH over its entire 
length; however, portions of the splice variant will have at least about 70%, or alternatively at least 
about 85%, or alternatively at least about 95%, or alternatively 100% polynucleotide sequence 
identity to portions of the polynucleotide sequence encoding TRICH. Any one of the splice variants 
described above can encode an amino acid sequence which contains at least one functional or 

35 structural characteristic of TRICH. 
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It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 

5 sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable 

10 of hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
TRICH or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 

15 which particular codons are utilized by the host. Other reasons for substantially altering the 

nucleotide sequence encoding TRICH and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 

20 TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 

25 hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in 
'Definitions." 

30 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 

35 ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
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preparation is automated with machines such as the MICROLAB 2200 liquid transfer system 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
CATALYST 800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either 
the ABI 373 or 377 DNA sequencing system (Applied Biosystems), the MEGABACE 1000 DNA 
5 sequencing system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The 
resulting sequences are analyzed using a variety of algorithms which are well known in the art. 
(See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular Biology , John Wiley & Sons, New 
York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New 
York NY, pp. 856-853.) 

10 The nucleic acid sequences encoding TRIOS may be extended utilizing a partial nucleotide 

sequence and employing various PCR-based methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For example, one method which may be 
employed, restriction-site PCR, uses universal and nested primers to amplify unknown sequence 
from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 

15 2:3 18-322.) Another method, inverse PCR, uses primers that extend in divergent directions to 
amplify unknown sequence from a circularized template. The template is derived from restriction 
fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et 
al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves PCR amplification 
of DNA fragments adjacent to known sequences in human and yeast artificial chromosome DNA. 

20 (See,e.g.,Lagerstrom,M.etal. (1991) PCR Methods Applic. 1:111-119.) In this method, multiple 
restriction enzyme digestions and ligations may be used to insert an engineered double-stranded 
sequence into a region of unknown sequence before performing PCR. Other methods which may be 
used to retrieve unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) 
Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR/nested primers, and 

25 PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers may be designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 

30 to the template at temperatures of about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 

35 sequence into 5' non-transcribed regulatory regions. 
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Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

5 emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

10 In another embodiment of the invention, polynucleotide sequences or fragments thereof 

which encode TRICH may be cloned in recombinant DNA molecules that direct expression of 
TRICH, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same 
or a functionally equivalent amino acid sequence may be produced and used to express TRICH. 

15 The nucleotide sequences of the present invention can be engineered using methods 

generally known in the art in order to alter TRICH-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the gene 
product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 

20 oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, 
and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques 
such as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 

25 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C et al. (1999) 
Nat. Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its 
ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of 
gene variants is produced using PCR-mediated recombination of gene fragments. The library is 

30 then subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds 
of DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing 
random point mutations may be recombined, screened, and then reshuffled until the desired 

35 properties are optimized. Alternatively, fragments of a given gene may be recombined with 
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fragments of homologous genes in the same gene family, either from the same or different species, 
thereby maximizing the genetic diversity of multiple naturally occurring genes in a directed and 
controllable manner. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in 
5 part, using chemical methods well known in the art. (See, e.g., Caruthers, MH. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 
7:225-232.) Alternatively, TRICH itself or a fragment thereof may be synthesized using chemical 
methods. For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques. (See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular 
10 Properties . WH Freeman, New York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using the ABI 431 A peptide synthesizer 
(Applied Biosy stems). Additionally, the amino acid sequence of TRICH, or any part thereof, may 
be altered during direct synthesis and/or combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide or a polypeptide having a sequence of a naturally occurring 
15 polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods EnzymoL 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, supra , pp. 28-53.) 

20 In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH 

or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3* untranslated regions in the vector and in 

25 polynucleotide sequences encoding TRICH. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TRICH. Such signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where sequences encoding TRICH and its initiation 
codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 

30 additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 

35 particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
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20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding TRICH and appropriate transcriptional and 
translation^ control elements. These methods include in vitro recombinant DNA techniques, 

5 synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
Molecular Cloning. A Laboratory Manual Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, F.M. et al. (1995) Current Protoc ols in Molecular Biology. John Wiley & Sons, 
New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 

10 sequences encoding TRICH. These include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 
mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti 

15 or pBR322 plasmids); or animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supya; Van 
Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhardi E.K. et al, (1994) 
Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; 
Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and Technology 
(1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl Acad. 

20 Sci. USA 81:3655-3659; and Harrington, LJ. et al. (1997) Nat. Genet. 15:345-355.) Expression 
vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or 
cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et 
al. (1993) Proc. Nad. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 

25 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol 31(3):219-226; and Verma, I.M. 
and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICH. For example, routine 
cloning, subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved 

30 using a multifunctional E. coli vector such as PBLUESCRJDPT (Stratagene, La Jolla CA) or 

PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding TRICH into the vector's 
multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for 
identification of transformed bacteria containing recombinant molecules. In addition, these vectors 
may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, 

35 and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. 
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Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of TRICHare needed, e.g. 
for the production of antibodies, vectors which direct high level expression of TRICH may be used. 
For example, vectors containing the strong, inducible SP6 or T7 bacteriophage promoter may be 
used. 

5 Yeast expression systems may be used for production of TRICH. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 

10 1995, supra ; Bitter, G.A et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. 
(1994) Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences 
encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO 

15 J. 6:307-3 1 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et 
al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
^pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology 

20 (1992) McGraw Hill, New York NY. pp. 19M96.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 

25 obtain infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk 

(1984) Proc. Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RS V) enhancer, may be used to increase expression in mammalian host cells. 
SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

30 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, JJ. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
35 of TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed 
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into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 

5 confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex vims thymidine kinase and adenine 

10 phosphoribosyltransferase genes, for use in tk and apf cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance 
to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and 
pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 

15 Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, 
which alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan 
(1988) Proc. Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green 
fluorescent proteins (GFP; Clontech), B glucuronidase and its substrate 6-glucuronide, or luciferase 

20 and its substrate luciferin may be used. These markers can be used not only to identify 

transformants, but also to quantify the amount of transient or stable protein expression attributable 
to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55: 121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest 
is also present, the presence and expression of the gene may need to be confirmed. For example, if 

25 the sequence encoding TRICH is inserted within a marker gene sequence, transformed cells 

containing sequences encoding TRICH can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding TRICH under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

30 In general, host cells that contain the nucleic acid sequence encoding TRICH and that 

express TRICH may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, 
or chip based technologies for the detection and/or quantification of nucleic acid or protein 

35 sequences. 
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Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 

5 monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. 
(See, e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul 
MN, Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology . Greene Pub. Associates 
and Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols . 

10 Humana Press, Totowa NL) 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TRICH 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled 

15 nucleotide. Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 

20 Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 

25 conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the 
sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode TRICH may be designed to contain signal 
sequences which direct secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

30 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" 
or "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 

35 Different host cells which have specific cellular machinery and characteristic mechanisms for 
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post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of die foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 

5 sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 

10 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His t 
FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of 
their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, 
and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable 

15 immunoaffinity purification of fusion proteins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 
engineered to contain a proteolytic cleavage site located between the TRICH encoding sequence and 
the heterologous protein sequence, so that TRICH may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are discussed 

20 in Ausubel (1995, supra , ch. 10). A variety of commercially available kits may also be used to 
facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with 

25 the T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 

30 oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or 
a natural binding partner. (See, e.g., Coligan, IE. et al. (1991) Current Protocols in Immunology 
l(2):_Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which 

35 TRICH binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, 
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the compound can be rationally designed using known techniques. In one embodiment, screening 
for these compounds involves producing appropriate cells which express TRICH, either as a 
secreted protein or on the cell membrane. Preferred cells include cells from mammals, yeast, 
Drosophila . or E. coli . Cells expressing TRICH or cell membrane fractions which contain TRICH 



5 are then contacted with a test compound and binding, stimulation, or inhibition of activity of either 
TRICH or the compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TRICH. either in 

10 solution or affixed to a solid support, and detecting the binding of TRICH to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, 
chemical libraries, or natural product mixtures, and the test compound(s) may be free in solution or 
affixed to a solid support. 

15 TRICH of the present invention or fragments thereof may be used to screen for compounds 

that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
TRICH activity, wherein TRICH is combined with at least one test compound, and the activity of 
TRICH in the presence of a test compound is compared with the activity of TRICH in the absence 

20 of the test compound. A change in the activity of TRICH in the presence of the test compound is 
indicative of a compound that modulates the activity of TRICH. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising TRICH under conditions suitable for 
TRICH activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of TRICH may do so indirectly and need not come in direct contact with the 

25 test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TRICH or their mammalian homologs 
may be "knocked out" in an animal model system using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well known in the art and are useful for the generation of 
animal models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 

30 5,767,337.) For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the 
early mouse embryo and grown in culture. TheES cells are transformed with a vector containing 
the gene of interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; 
Capecchi, M.R. (1989) Science 244:1288-1292). The vector integrates into the corresponding 
region of the host genome by homologous recombination. Alternatively, homologous 

35 recombination takes place using the Cre-loxP system to knockout a gene of interest in a tissue- or 
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developmental stage-specific manner (Marth, J.D. (1996) Clin. Invest. 97: 1999-2002; Wagner, K.U. 
et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are identified and 
microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. The 
blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
5 are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate 
cell lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages 

10 differentiate into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, 
J.A. et al. (1998) Science 282:1145-1147). 

Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected 

15 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may 
also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 

20 4:55-74), 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TRICH and transporters and ion channels. In addition, examples of tissues 
expressing TRICH can be found in Table 6. Therefore, TRICH appears to play a role in transport, 
25 neurological, muscle, immunological and cell proliferative disorders. In the treatment of disorders 

associated with increased TRICH expression or activity, it is desirable to decrease the expression or 

i 

activity of TRICH. In the treatment of disorders associated with decreased TRICH expression or 
activity, it is desirable to increase the expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

30 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

35 normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug 
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resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 
peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e.g., angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 

5 mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 
Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 

10 Wilson's disease, cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, 
hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, 
glucose-galactose malabsorption syndrome, glycogen storage disease, hypercholesterolemia, 
adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, iminoglycinuria, Hartup 

15 disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 

20 diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

25 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the centra] 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic 

30 paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic migraine, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 

35 dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
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lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 
inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 

5 epilepsy, Keams-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid 
maltase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 
congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 

10 thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, bypereosinophilia, irritable bowel syndrome, multiple 

15 sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 

20 helminthic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 

25 bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 
prevent a disorder associated with decreased expression or activity of TRICH including, but not 
limited to, those provided above. 

35 In still another embodiment, an agonist which modulates the activity of TRICH may be 
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administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of TRICH. Examples of such 

5 disorders include, but are not limited to, those transport, neurological, muscle, immunological and 
cell proliferative disorders described above. In one aspect, an antibody which specifically binds 
TRICH may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TRICH. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 

10 encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 

15 by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be produced using methods which are generally known in the 

20 art. In particular, purified TRICH may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind TRICH. ' Antibodies to TRICH may 
also be generated using methods that are well known in the art. Such antibodies may include, but 
are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, 
and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 

25 inhibit dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TRICH or with any fragment or oligopeptide 
thereof which has immunogenic properties. Depending on the host species, various adjuvants may 
be used to increase immunological response. Such adjuvants include, but are not limited to, 

30 Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as 

lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
35 TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
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consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein, Short 
stretches of TRICH amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 

5 Monoclonal antibodies to TRICH may be prepared using any technique which provides for 

the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol, Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad. Sci. USA 

10 80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62: 109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and 

15 Takeda, S. et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 

production of single chain antibodies may be adapted, using methods known in the art, to produce 
TRICH-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134- 

20 10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

25 Antibody fragments which contain specific binding sites for TRICH may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

30 et al. (1989) Science 246: 1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities axe well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 

35 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
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reactive to two non-interfering TRICH epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an 

5 association constant, K,, which is defined as the molar concentration of TRICH-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K a determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies 
for TRICH. The JL, determined for a preparation of monoclonal antibodies, which are monospecific 

10 for a particular TRICH epitope, represents a true measure of affinity. High-affinity antibody 
preparations with K n ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays 
in which the TRICH-antibody complex must withstand rigorous manipulations. Low-affinity 
antibody preparations with ranging from about 10 6 to 10 7 L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of TRICH, 

15 preferably in active form, from the antibody (Catty, D. (1988) Antibodies. Volume I: A Practical 
A pproach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 

20 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring 
precipitation of TRICH-antibody complexes. Procedures for evaluating antibody specificity, titer, 
and avidity, and guidelines for antibody quality and usage in various applications, are generally 
available. (See, e.g., Catty, supra , and Coligan et al. supra .) 

25 In another embodiment of the invention, the polynucleotides encoding TRICH, or any 

fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 
modifications of gene expression can be achieved by designing complementary sequences or 
antisense molecules (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory 
regions of the gene encoding TRICH. Such technology is well known in the art, and antisense 

30 oligonucleotides or larger fragments can be designed from various locations along the coding or 
control regions of sequences encoding TRICH. (See, e.g., Agrawal, S., ed. (1996) Antisense 
Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 

35 intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
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complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellular^ through the use of 
viral vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) 

5 Blood 76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Then 63(3):323- 
347.) Other gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, 
and other systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; 
Boado. R.J. et al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C. et al. (1997) Nucleic 
Acids Res. 25(14):2730-2736.) 

10 In another embodiment of the invention, polynucleotides encoding TRICH may be used for 

somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic 
deficiency (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease 
characterized by X-linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), 
severe combined immunodeficiency syndrome associated with an inherited adenosine deaminase 

15 (ADA) deficiency (Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) 
Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. 
(1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), 
thalassamias, familial hypercholesterolemia, and hemophilia resulting from Factor VDI or Factor DC 
deficiencies (Crystal, R.G. (1995) Science 270:404^10; Verma, LM. and N. Somia (1997) Nature 

20 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of cancers which 
result from unregulated cell proliferation), or (iii) express a protein which affords protection against 
intracellular parasites (e.g., against human retroviruses, such as human immunodeficiency virus 
(HIV) (Baltimore, D. (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93: 1 1395-1 1399), hepatitis B or C virus (HB V, HCV); fungal parasites, such as Candida 

25 albicans and Paracoccidioides brasiliensis ; and protozoan parasites such as Plasmodium falciparum 
and Trypanosoma cruzr) . In the case where a genetic deficiency in TRICH expression or regulation 
causes disease, the expression of TRICH from an appropriate population of transduced cells may 
alleviate the clinical manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

30 TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies 
for use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 

35 Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. andH. R6cipon (1998) Curr. 
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Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRDPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla 

5 CA), and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). 
TRICH may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus 
(CMV), Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an 
inducible promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) 
Proc. Natl. Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, 

10 F.M.V. and H.M. Blau (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the 
T-REX plasmid (Invitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR 
and PIND; Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone 
inducible promoter (Rossi, F.M.V. and H.M. Blau, supra) ), or (iii) a tissue-specific promoter or the 
native promoter of the endogenous gene encoding TRICH from a normal individual. 

15 Commercially available liposome transformation kits (e.g., the PERFECT LIPID 

TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 

20 (1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects 
with respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 

25 terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 

30 an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, 
R. et al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 

35 retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
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discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
5 7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with 

10 respect to the expression of TRICH. The construction and packaging of adenovirus-based vectors 
are well known to those with ordinary skill in the art. Replication defective adenovirus vectors have 
proven to be versatile for importing genes encoding immunoregulatory proteins into intact islets in 
the pancreas (Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral 
vectors are described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene 

15 therapy"), hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. 
(1999) Annu. Rev. Nutr. 19:511-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, 
both incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 

20 respect to the expression of TRICH. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HSV 
has a tropism. The construction and packaging of herpes-based vectors are well known to those 
with ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based 
vector has been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. 

25 Eye Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail 
in U.S. Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell 
under the control of the appropriate promoter for purposes including human gene therapy. Also 

30 taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, 
ICP27 and ICP22. For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, 
H. et al. (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of 
cloned herpesvirus sequences, the generation of recombinant virus following the transfection of 
multiple plasmids containing different segments of the large herpesvirus genomes, the growth and 

35 propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well known to 
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those of ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 

5 on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic 
activity (e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into 

10 the alphavirus genome in place of the capsid-coding region results in the production of a large 
number of TRICH-coding RNAs and the synthesis of high levels of TRICH in vector transduced 
cells. While alphavirus infection is typically associated with cell lysis within a few days, the ability 
to establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis 
virus (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the 

15 gene therapy application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of TRICH into a variety of cell types. The specific 
transduction of a subset of cells in a population may require the sorting of cells prior to 
transduction. The methods of manipulating infectious cDNA clones of alphaviruses, performing 
alphavirus cDNA and RNA transfections, and performing alphavirus infections, are well known to 

20 those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the double helix to open sufficiently for the 

25 binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, 
B.E. and B.L Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 
163-177.) A complementary sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

30 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 

of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding TRICH. 

35 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
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scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU/and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 

5 candidate targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include 
techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite 

10 chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo 

transcription of DNA sequences encoding TRICH. Such DNA sequences may be incorporated into 
a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. 
Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

15 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditionai bases such as 

20 inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. 

25 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors 

30 or promoters of polynucleotide expression. Thus, in the treatment of disorders associated with 
increased TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

35 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
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altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective 
in altering polynucleotide expression; selection from an existing, commercially-available or 
proprietary library of naturally-occurring or non-natural chemical compounds; rational design of a 

5 compound based on chemical and/or structural properties of the target polynucleotide; and selection 
from a library of chemical compounds created combinatorially or randomly. A sample comprising 
a polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The 
sample may comprise, foT example, an intact or permeabilized cell, or an in vitro cell-free or 
reconstituted biochemical system. Alterations in the expression of a polynucleotide encoding 

10 TRICH are assayed by any method commonly known in the art. Typically, the expression of a 
specific nucleotide is detected by hybridization with a probe having a nucleotide sequence 
complementary to the sequence of the polynucleotide encoding TRICH. The amount of 
hybridization may be quantified, thus forming the basis for a comparison of the expression of the 
polynucleotide both with and without exposure to one or more test compounds. Detection of a 

15 change in the expression of a polynucleotide exposed to a test compound indicates that the test 
compound is effective in altering the expression of the polynucleotide. A screen for a compound 
effective in altering expression of a specific polynucleotide can be carried out, for example, using a 
Schizosaccharomvces pombe gene expression system (Atkins, D. et al. (1999) U.S. Patent No. 
5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a human cell line such as HeLa 

20 cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 268:8-13). A particular 

embodiment of the present invention involves screening a combinatorial library of oligonucleotides 
(such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et 
al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

25 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be 
achieved using methods which are well known in the art (See, e.g., Goldman, C.K. et al. (1997) 

30 Nat. Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, 
and monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
35 which generally comprises an active ingredient formulated with a pharmaceutically acceptable 
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excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may 
consist of TRICH, antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of 
5 TRICH. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arteriai, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

10 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger 
peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 

15 region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Pattoh, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 
enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
20 ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
25 the macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic 
N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. 
et al. ( 1999) Science 285: 1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
30 culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
35 TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of 
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TRICH, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 
such as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 
(the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is 

5 the therapeutic index, which can be expressed as the LD 5( /ED 50 ratio. Compositions which exhibit 
large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED S0 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 

10 the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 
the active moiety or to maintain the desired effect. Factors which may be taken into account include 
the severity of the disease state, the general health of the subject, the age, weight, and gender of the 

15 subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 

response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 Mg to 100,000 /ig, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

20 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

25 In another embodiment, antibodies which specifically bind TRICH may be used for the 

diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TRICH include methods which utilize the antibody and a label to detect 

30 TRICH in human body fluids or in extracts of ceils or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 

A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are 

35 known in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. 
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Normal or standard values for TRICH expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibodies to 
TRICH under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, such as photometric means. Quantities of 
5 TRICH expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
10 sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used 
to detect and quantify gene expression in biopsied tissues in which expression of TRICH may be 
correlated with disease. The diagnostic assay may be used to determine absence, presence, and 
excess expression of TRICH, and to monitor regulation of TRICH levels during therapeutic 
intervention. 

15 In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding TRICH or closely related molecules may be 
used to identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether 
it is made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, 
e.g., a conserved motif, and the stringency of the hybridization or amplification will determine 

20 whether the probe identifies only naturally occurring sequences encoding TRICH, allelic variants, 
or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or 
25 from genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 
cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 
production of mRNA probes. Such vectors are known* in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
30 polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
35 transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic 
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fibrosis, Becker* s muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, 
diabetes insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic 
paralysis, normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug 
resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 

5 peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e.g., angina, bradyanythmia, tachyarrythmia, hypertension, Long QT syndrome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 
mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 

10 Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., x 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 
Wilson's disease, cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, 
hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, 

15 glucose-galactose malabsorption syndrome, glycogen storage disease, hypercholesterolemia, 

adrenoieukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, iminoglycinuria, Hartup 
disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 

20 Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 

25 disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 

Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

30 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic 
paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic migraine, tardive 

35 dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
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supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 
dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 

5 inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoraa, and myopathies including encephalopathy, 
epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid 
maltase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 

10 congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendocrinopatliy-candidiasis-ectodermal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 

15 diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

20 Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 

25 (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 

primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 

30 thymus, thyroid, and uterus. The polynucleotide sequences encoding TRICH may be used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing 
fluids or tissues from patients to detect altered TRICH expression. Such qualitative or quantitative 
methods are well known in the art. 

35 In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays 
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that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After 
a suitable incubation period, the sample is washed and the signal is quantified and compared with a 

5 standard value. If the amount of signal in the patient sample is significantly altered in comparison 
to a control sample then the presence of altered levels of nucleotide sequences encoding TRICH in 
the sample-indicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 
trials, or to monitor the treatment of an individual patient. 

10 In order to provide a basis for the diagnosis of a disorder associated with expression of 

TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 

15 normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

20 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained 
from successive assays may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

25 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 
appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

30 Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

TRICH may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding TRICH, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding TRICH, and will be employed under optimized conditions for 

35 identification of a specific gene or condition. Oligomers may also be employed under less stringent 
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conditions for detection or quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 

5 disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequences encoding TRICH are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 

10 differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database 
analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 

15 comparing the sequence of individual overlapping DNA fragments which assemble into a common 
consensus sequence. These computer-based methods filter out sequence variations due to 
laboratory preparation of DNA and sequencing errors using statistical models and automated 
analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for example, the high throughput MASSARRAY system 

20 (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

25 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide 
of interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The 

30 microarray can be used in transcript imaging techniques which monitor the relative expression 
levels of large numbers of genes simultaneously as described below. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

35 monitor the activities of therapeutic agents in the treatment of disease. In particular, this 
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information may be used to develop a phannacogenomic profile of a patient in order to select the 
most appropriate and effective treatment regimen for that patient. For example, therapeutic agents 
which are highly effective and display the fewest side effects may be selected for a patient based on 
his/her phannacogenomic profile. 
5 In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH 

may be used as elements on a microarray. The microarray may be used to monitor or measure 
protein-protein interactions, drug-target interactions, and gene expression profiles, as described 
above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

10 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern 
of gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed 
by quantifying the number of expressed genes and their relative abundance under given conditions 
and at a given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent 
No. 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be 

15 generated by hybridizing the polynucleotides of the present invention or their complements to the 
totality of transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, 
the hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

20 Transcript images may be generated using transcripts isolated from tissues, cell lines, 

biopsies, or other biological samples. The transcript image may thus reflect gene expression in 
vivo , as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 

25 pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds, All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 1 12-1 13:467-471, expressly incorporated by reference herein). If a test 

30 compound has a signature similar to that of a compound with known toxicity, it is likely to share 
those toxic properties. These fingerprints or signatures are most useful and refined when they 
contain expression information from a large number of genes and gene families. Ideally, a genome- 
wide measurement of expression provides the highest quality signature. Even genes whose 
expression is not altered by any tested compounds are important as well, as the levels of expression 

35 of these genes are used to normalize the rest of the expression data. The normalization procedure is 
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useful for comparison of expression data after treatment with different compounds. While the 
assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National 
5 Institute of Environmental Health Sciences, released February 29, 2000, available at 

http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
10 treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared 
with levels in an untreated biological sample. Differences in the transcript levels between the two 
samples are indicative of a toxic response caused by the test compound in the treated sample. 
15 Another particular embodiment relates to the use of the polypeptide sequences of the 

present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 
global pattern of protein expression in a particular tissue or cell type. Each protein component of a 
proteome can be subjected individually to further analysis. Proteome expression patterns, or 
profiles, are analyzed by quantifying the number of expressed proteins and their relative abundance 
20 under given conditions and at a given time. A profile of a cell's proteome may thus be generated by 
separating and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the 
separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample 
are separated by isoelectric focusing in the first dimension, and then according to molecular weight 
by sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 
25 supra ). The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 
staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 
optical densities of equivalently positioned protein spots from different samples, for example, from 
biological samples either treated or untreated" with a test compound or therapeutic agent, are 
30 compared to identify any changes in protein spot density related to the treatment. The proteins in 
the spots are partially sequenced using, for example, standard methods employing chemical or 
enzymatic cleavage followed by mass spectrometry. The identity of the protein in a spot may be 
determined by comparing its partial sequence, preferably of at least 5 contiguous amino acid 
residues, to the polypeptide sequences of the present invention. In some cases, further sequence 
35 data may be obtained for definitive protein identification. 
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A proteomic profile may also be generated using antibodies specific for TRICH to quantify 
the levels of TRICH expression. In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample 
and detecting the levels of protein bound to each array element (Luelang, A. et al. (1999) Anal. 

5 Biochem. 270: 103-1 11; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be 
performed by a variety of methods known in the art, for example, by reacting the proteins in the 
sample with a thiol- or amino-reactive fluorescent compound and detecting the amount of 
fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 

10 should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 
useful in the analysis of compounds which do not significantly affect the transcript image, but 
which alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, 

15 due to rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in 
such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 

20 each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by 
sequencing the amino acid residues of the individual proteins and comparing these partial sequences 
to the polypeptides of the present invention. 

25 In another embodiment, the toxicity of a test compound is assessed by treating a biological 

sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount 

30 of protein between the two samples is indicative of a toxic response to the test compound in the 
treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; 
35 Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. 
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Sci. USA 94:2150-2155; and Heller, ML et al. (1997) U.S. Patent No. 5,605,662.) Various types of 
mdcroarrays are well known and thoroughly described in DNA Microarravs: A Practical Approach . 
M. Schena, ed. (1999) Oxford University Press, London, hereby expressly incorporated by 
reference. 

5 In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 

used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences 
may be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 

10 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, 
LL et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.L 

15 (1991) Trends Genet. 7: 149-154.) Once mapped, the nucleic acid sequences of the invention may 
be used to develop genetic linkage maps, for example, which correlate the inheritance of a disease 
state with the inheritance of a particular chromosome region or restriction fragment length 
polymorphism (RFLP). (See, for example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. 
Sci. USA 83:7353-7357.) 

20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

map data. (See, e.g.,'Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of 
genetic map data can be found in various scientific journals or at the Online Mendelian Inheritance 
in Man (OMIM) World Wide Web site. Correlation between the location of the gene encoding 
TRICH on a physical map and a specific disorder, or a predisposition to a specific disorder, may 

25 help define the region of DNA associated with that disorder and thus may further positional cloning 
efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 

30 may reveal associated markers even if the exact chromosomal locus is not known. This information 
is valuable to investigators searching for disease genes using positional cloning or other gene ' 
discovery techniques. Once the gene or genes responsible for a disease or syndrome have been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 

35 further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
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sequence of the instant invention may also be used to detect differences in the chromosomal 

location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
5 screening techniques. The fragment employed in such screening may be free in solution, affixed to 

a solid support, borne on a cell surface, or located intracellularly. The formation of binding 

complexes between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
10 application WO84/03564.) In this method, large numbers of different small test compounds are 

synthesized on a solid substrate. The test compounds are reacted with TRICH, or fragments thereof, 

and washed. Bound TRICH is then detected by methods well known in the art. Purified TRICH can 

also be coated directly onto plates for use in the aforementioned drug screening techniques. 

Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
15 solid support. 

In another embodiment, one may use competitive drug screening assays in which 

neutralizing antibodies capable of binding TRICH specifically compete with a test compound for 

binding TRICH. In this manner, antibodies can be used to detect the presence of any peptide which 

shares one or more antigenic determinants with TRICH. 
20 In additional embodiments, the nucleotide sequences which encode TRICH may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques rely 

on properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
25 description, utilize the present invention to its fullest extent. The following embodiments are, 

therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 

in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, 

including US. Ser. No. 60/254,303, U.S. Ser. No. 60/256,190, U.S. Ser. No. 60/257,504, U.S. Ser. 
30 No. 60/261,546, U.S. Ser. No. 60/262,832, U.S. Ser. No. 60/264,377, and U.S. Ser. No. 60/266,019, 

are expressly incorporated by reference herein. 

EXAMPLES 

I. Construction of cDNA Libraries 
35 Incyte cDNAs were derived from cDNA libraries described in the LEFESEQ GOLD 
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database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in 
guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a suitable 
mixture of denaturants, such as TR1ZOL (Life Technologies), a monophasic solution of phenol and 
guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted 

5 with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate 
and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 

10 (QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 

15 UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using 
the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , 
units 5. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with 
the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected 

20 (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. 
cDNAs were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, 
e.g., PBLUESCRDPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 
plasmid (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), PCR2-TOPOTA plasmid 

25 (Invitrogen), PCMV-ICIS plasmid (Stratagene), pIGEN (Incyte Genomics, Palo Alto CA), pRARE 
(Incyte Genomics), or pINCY (Incyte Genomics), or derivatives thereof. Recombinant plasmids 
were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5o, DH10B, or ElectroMAX DH10B from Life Technologies. 
EL Isolation of cDNA Clones 

30 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using-the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified 
using at least one of the following: a Magic or WIZARD Minipreps DNA purification system 
(Promega); an AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and 
QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems 

35 or the R.E.AX. PREP 96 plasmid purification kit from QIAGEN. Following precipitation, plasmids 
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were resuspended in 0. 1 ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

5 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

10 Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such 

15 as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 

20 frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the 
techniques disclosed in Example VIE 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

25 programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. 
The Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, 
and BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences fromiJomQ 
sapiens . Rattus norvegicus . Mus musculus . Caenorhabditis elegans . Saccharomvces cerevisiae. 

30 Snhizosaccharomvces pombe . and Candida albicans (Incyte Genomics, Palo Alto CA); and hidden 
Markov model (HMM)-based protein family databases such as PFAM. (HMM is a probabilistic 
approach which analyzes consensus primary structures of gene families. See, for example, Eddy, 
S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) The queries were performed using programs 
based on BLAST, FASTA, BLIMPS, and HMMER. The Incyte cDNA sequences were assembled 

35 to produce full length polynucleotide sequences. Alternatively, GenBank cDNAs, GenBank ESTs, 
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stitched sequences, stretched sequences, or Genscan-predicted coding sequences (see Examples IV 
and V) were used to extend Incyte cDNA assemblages to full length. Assembly was performed 
using programs based on Phred, Phrap, and Consed, and cDNA assemblages were screened for open 
reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 

5 polynucleotide sequences were translated to derive the corresponding full length polypeptide 
sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine 
residues of the full length translated polypeptide. Full length polypeptide sequences were 
subsequently analyzed by querying against databases such as the GenBank protein databases 
(genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 

10 and hidden Markov model (HMM)-based protein family databases such as PFAM. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program 

15 (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly 
of Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 

20 references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 

25 and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO: 33-64. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 

30 identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan 
is a general-purpose gene identification program which analyzes genomic DNA sequences from a 
variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol Biol. 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 

35 Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range 
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of sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
predicted cDNA sequences encode transporters and ion channels, the encoded polypeptides were 
analyzed by querying against PFAM models for transporters and ion channels. Potential 
transporters and ion channels were also identified by homology to Incyte cDNA sequences that had 

5 been annotated as transporters and ion channels. These selected Genscan-predicted sequences were 
then compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST 
analysis was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 

10 sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
this information was used to correct or confirm the Genscan predicted sequence. Full length 
polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with 
Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example IE Alternatively, full length polynucleotide sequences were derived entirely from edited 

15 or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 

20 m were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an 
algorithm based on graph theory and dynamic programming to integrate cDNA and genomic 
information, generating possible splice variants that were subsequently confirmed, edited, or 
extended to create a full length sequence. Sequence intervals in which the entire length of the 

25 interval was present on more than one sequence in the cluster were identified, and intervals thus 
identified were considered to be equivalent by transitivity. For example, if an interval was present 
on a cDNA and two genomic sequences, then all three intervals were considered to be equivalent. 
This process allows unrelated but consecutive genomic sequences to be brought together, bridged by 
cDNA sequence. Intervals thus identified were then "stitched" together by the stitching algorithm 

30 in the order that they appear along their parent sequences to generate the longest possible sequence, 
as well as sequence variants. Linkages between intervals which proceed along one type of parent 
sequence (cDNA to cDNA or genomic sequence to genomic sequence) were given preference over 
linkages which change parent type (cDNA to genomic sequence). The resultant stitched sequences 
were translated and compared by BLAST analysis to the genpept and gbpri public databases. 

35 Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit from 
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genpept. Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 

5 analysis. First, partial cDNAs assembled as described in Example III were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 

10 (HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for 
homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 

15 resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 

20 SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Gen6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the 

25 assignment of all sequences of that cluster, including its particular SEQ ID NO:, to that map 
location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's 
p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies 

30 between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of 
DNA in humans, although this can vary widely due to hot and cold spots of recombination.) The 
cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI "GeneMap'99" World Wide 

35 Web site (http://www.ncbi.nbn.nih.gov/genemap/), can be employed to determine if previously 
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VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
5 RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; 
Ausubel (1995) supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This 
analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 
10 the computer search can be modified to determine whether any particular match is categorized as 
exact or similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 * minimum {length(Seq. 1), length(Seq. 2)} 

15 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 

20 calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to 
calculate the product score. The product score represents a balance between fractional overlap and 
quality in a BLAST alignment For example, a product score of 100 is produced only for 100% 

25 identity over the entire length of the shorter of the two sequences being compared. A product score 
of 70 is produced either by 100% identity and 70% overlap at one end, or by 88% identity and 100% 
overlap at the other. A product score of 50 is produced either by 100% identity and 50% overlap at 
one end, or 79% identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

30 tissue sources from which they were derived. For example, some full length sequences are 

assembled, at least in part, with overlapping Incyte cDNA sequences (see Example HI). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective 
tissue; digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; 

35 genitalia, male; germ cells; hemic and immune system; liver; musculoskeletal system; nervous 
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system; pancreas; respiratory system; sense organs; skin; stomatognathic system; 
unclassified/mixed; or urinary tract. The number of libraries in each category is counted and 
divided by the total number of libraries across all categories. Similarly, each human tissue is 
classified into one of the following disease/condition categories: cancer, cell line, developmental, 

5 inflammation, neurological, trauma, cardiovascular, pooled, and other, and the number of libraries 
in each category is counted and divided by the total number of libraries across all categories. The 
resulting percentages reflect the tissue- and disease-specific expression of cDNA encoding TRICH. 
cDNA sequences and cDNA library/tissue information are found in the LIFESEQ GOLD database 
(hcyte Genomics, Palo Alto CA). 

10 MIL Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment. 
One primer was synthesized to initiate 5* extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 

15 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72°C Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDN A libraries were used to extend the sequence. If more than one 

20 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, 
(NH 4 ) 2 S0 4 , and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 

25 ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; 
Step 3: 60 °C, 1 min; Step 4: 68 °C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 
5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as 
follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57 °C, 1 min; Step 4: 68°C, 2 min; Step 

30 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 jil PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 fi\ of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Huoroskan II 

35 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
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concentration of DNA. A 5 iA to 10 /A aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction 
site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 
384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

15 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 

20 diluted with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotide sequences are verified using the above procedure 
or are used to obtain 5' regulatory sequences using the above procedure along with oligonucleotides 

25 designed for such extension, and an appropriate genomic library. 
IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 33-64 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 mCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 

35 An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 



81 



WO 02/46415 



PCT/US01/46963 



based hybridization analysis of human genomic DNA digested with one of the following 
endonucieases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEK). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
5 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

10 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra .1. 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena 
(1999), supra) . Suggested substrates include silicon, silica, glass slides, glass chips, and silicon 

15 wafers. Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and 
link elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 

20 Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization 
can be selected using software well known in the art such as LASERGENE software (DNASTAR). 
The array elements are hybridized with polynucleotides in a biological sample. The polynucleotides 

25 in the biological sample are conjugated to a fluorescent label or other molecular tag for ease of 
detection. After hybridization, nonhybridized nucleotides from the biological sample are removed, 
and a fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 

30 on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly (A) 4 RNA sample is 

35 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/ptl oligo-(dT) primer (21mer), IX 
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first strand buffer, 0.03 units//il RNase inhibitor, 500 /aM dATP, 500 pM dGTP, 500 /iM dTTP, 40 
fiU dCTP, 40 fiM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro 

5 transcription from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction 
sample (one with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium 
hydroxide and incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. 
Samples are purified using two successive CHROMA SPIN 30 gel filtration spin columns 
(CLONTECH Laboratories, Inc. (CLONTECH), Palo Alto CA) and after combining, both reaction 

10 samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 
ml of 100% ethanol. The sample is then dried to completion using a SpeedVAC (Savant 
Instruments Inc., Holbrook NY) and resuspended in 14 fi\ 5X SSC/0.2% SDS. 
Microarrav Preparation 

Sequences of the present invention are used to generate amy elements. Each array element 

15 is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
fig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

20 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 

25 U0°Coven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 /xl of the array element DNA, at an 
average concentration of 100 ng/jxl, is loaded into the open capillary printing element by a high- 
speed robotic apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

30 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0,2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 
0.2% SDS and distilled water as before. 

35 Hybridization 
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Hybridization reactions contain 9 fil of sample mixture consisting of 0.2 jig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just 

5 slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 pi of 5X SSC in a corner of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60° C. The arrays are washed for 10 rain at 45° C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X 
SSC), and dried. 

10 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral 
lines at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light 
is focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

15 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with 
a resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomuitiplier tube detectors (PMT R1477, 

20 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. 

Appropriate filters positioned between the array and the photomuitiplier tubes are used to filter the 
signals. The emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. 
Each array is typically scanned twice, one scan per fluorophore using the appropriate filters at the 
laser source, although the apparatus is capable of recording the spectra from both fluorophores 

25 simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location 
on the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two 

30 samples from different sources (e.g., representing test and control cells), each labeled with a 

different fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with 
the two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomuitiplier tube is digitized using a 12-bit RTI-835H analog-to- 

35 digital (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM- 
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compatible PC computer. The digitized data are displayed as an image where the signal intensity is 
mapped using a linear 20-color transformation to a pseudocolor scale ranging from blue (low 
signal) to red (high signal). The data is also analyzed quantitatively. Where two different 
fluorophores are excited and measured simultaneously, the data are first corrected for optical 
5 crosstalk (due to overlapping emission spectra) between the fluorophores using each fluorophore's 
emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
10 software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

XI. Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

15 procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of TRICH. 
To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' 
sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding 

20 transcript. 

XII. Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 

25 transcription. Examples of such promoters include, but are not limited to, the trplac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 

30 insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis 
virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of 
baculovirus is replaced with cDNA encoding TRICH by either homologous recombination or 
bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 

35 Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells in most cases, or 
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human hepatocytes, in some cases. Infection of the latter requires additional genetic modifications 
to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; 
Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione 

5 S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 

10 TRICH at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 
purification using commercially available monoclonal and polyclonal anti-FLAG antibodies 
(Eastman Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on 
metal-chelate resins (QIAGEN)- Methods for protein expression and purification are discussed in 
Ausubel (1995, supra , ch. 10 and 16). Purified TRICH obtained by these methods can be used 

15 directly in the assays shown in Examples XVI, XVII, and XVm where applicable. 
XIII. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 

20 expression. Vectors of choice include PCMV SPORT (Life Technologies) and PCR3. 1 (Invitrogen, 
Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 y.% of recombinant vector 
are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell 
line, using either liposome formulations, or electroporation, 1-2 fug of an additional plasmid 
containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 

25 provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry 
(FCM), an automated, laser optics-based technique, is used to identify transfected cells expressing 
GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. 

30 FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or 
coincident with cell death. These events include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 

35 proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 
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composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M.G, (1994) Flow Cytometry . 
Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified 

5 populations of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved 
regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from 
nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 
(DYNAL, Lake Success NY). mRNA can be purified from the cells using methods well known by 

10 those of skill in the art. Expression of mRNA encoding TRICH and other genes of interest can be 
analyzed by northern analysis or microarray techniques. 
XTV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used 

15 to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 

20 well described in the ait (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 43 1A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) 
to increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 

25 oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 

antipeptide and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated 
goat anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 
30 Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

chromatography using antibodies specific for TRICH: An immunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 
35 Media containing TRICH are passed over the immunoaffinity column, and the column is 



87 



WO 02/46415 



PCT/US01/46963 



washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and TRICH is collected. 

5 XVI. Identification of Molecules Which Interact with TRICH 

Molecules which interact with TRICH may include transporter substrates, agonists or 
antagonists, modulatory proteins such as Gffy proteins (Reimann, supra) or proteins involved in 
TRICH localization or clustering such as MAGUKs (Craven, supra) . TRICH, or biologically active 
fragments thereof, are labeled with 125 I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. 

10 Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a 
multi-well plate are incubated with the labeled TRICH, washed, and any wells with labeled TRICH 
complex are assayed. Data obtained using different concentrations of TRICH are used to calculate 
values for the number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 

15 (Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 
protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute 
a transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems 

20 are commercially available, and methods for use of the yeast 2-hybrid system with ion channel 
proteins are discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293:104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all 
interactions between the proteins encoded by two large libraries of genes (Nandabalan, K. et aL 

25 (2000) U.S. Patent No. 6,057,101). 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of 
TRICH ion channel activity using the assays described in section XVHL 
XVII. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 

30 conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art. A second plasmid which expresses any one of a number of marker genes, such as 
fi-galactosidase, is co-transformed into the cells to allow rapid identification of those cells which 

35 have taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 
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transformation under conditions appropriate for the cell line to allow expression and accumulation 
of TRICH and B-galactosidase. 

Transformed cells expressing B-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are well known in the art. Stained cells 

5 are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art. Untransfonned cells, and/or cells transformed with either vector sequences alone 
or B-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing 
TRICH will have higher anion or cation conductance relative to control cells. The contribution of 
TRICH to conductance can be confirmed by incubating the cells using antibodies specific for 

10 TRICH. The antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in 
the ion channel, and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi 
et al., supra ; Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 

15 appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into * 
mature stage IV oocytes. Injected oocytes are incubated at 18°Cfor 1-5 days. Inside-out 
macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 
of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCl^, where appropriate. 

20 Electrode resistance is set at 2-5 MQ and electrodes are filled with the intracellular solution lacking 
mediator. Experiments are performed at room temperature from a holding potential of 0 mV. 
Voltage ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. 
Current measured is proportional to the activity of TRICH in the assay. In particular, the activity of 
TRICH-1 is measured as Na* conductance and the activity of TRICH-3 is measured as Ca 2+ 

25 conductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 
2t ImM MgCl 2 , ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50/ig/ml gentamycin, pH 7.8) to 

30 allow expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM 
NaCl, 2 mM KC1, ImM CaCl 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various 
substrates (e.g., amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding 
labeled substrate (e.g. radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the 
oocytes. After incubating for 30 minutes, uptake is terminated by washing the oocytes three times 

35 in Na*-free medium, measuring the incorporated label, and comparing with controls. TRICH 
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activity is proportional to the level of internalized labeled substrate. Test substrates include ran- 
GTP for TRICH- 2, glucose for TRICH-5, amino acids for TRICH-6 and TRICH-14, cations for 
TRICH-7 and TRICH-16, Na + , K + and CI* ions for TRICH-15, reduced folate or analogues such as 
methotrexate for TRICH-17, divalent cations for TRICH-18, anions such as arsenate and antimonite 

5 for TRICH-19, and nitrate or oligopeptides for TRICH-20. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled 
ATP-[y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of 
the recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and 
varying amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The 

10 reaction is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, 
and an aliquot of the reaction mixture is subjected to membrane or filter paper-based 
chromatography to separate the reaction products. The amount of 32 P liberated is counted in a 
scintillation counter. The amount of radioactivity recovered is proportional to the ATPase activity 
of TRICH in the assay. 

15 Lipocalin activity of TRICH is measured by ligand fluorescence enhancement 

spectrofluorometry (Lin et al. (1997) Molecular Vision 3:17). Examples of ligands include retinol 
(Sigma, St. Louis MO) and 16-anthryloxy-palmitic acid (16-AP) (Molecular Probes Inc., Eugene 
OR). Ligand is dissolved in 100% ethanol and its concentration is estimated using known 
extinction coefficents (retinol: 46,000 A/M/cm at 325 nra; 16-AP: 8,200 A/M/cm at 361 nm). A 

20 700 nl aliquot of 1 /xM TRICH in 10 mM Tris (pH 7.5), 2 mM EDTA, and 500 mM NaCl is placed 
in a 1 cm path length quartz cuvette and 1 fi\ aliquots of ligand solution are added. Fluorescence is 
measured 100 seconds after each addition until readings are stable. Change in fluorescence per unit 
change in ligand concentration is proportional to TRICH activity. 
XVm. Identification of TRICH Agonists and Antagonists 

25 TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or 

HEK (Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in 
the presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 
patch clamp methods well known in the art or as described in Example XVII. Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 

30 membrane (Velicelebi, G. et al. (1999) Meth. Enzymol. 294:20-47; West, M.R. and C.R. Moiloy 
(1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening 
using microplates. Changes in internal ion concentration are measured using fluorescent dyes such 
as the Ca 2+ indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI" 
indicator MQAE (all available from Molecular Probes) in combination with the FLIPR fluorimetric 

35 plate reading system (Molecular Devices). In a more generic version of this assay, changes in 
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membrane potential caused by ionic flux across the plasma membrane are measured using oxonyl 
dyes such as DiBAC 4 (Molecular Probes). DiBAC 4 equilibrates between the extracellular solution 
and cellular sites according to the cellular membrane potential. The dye's fluorescence intensity is 
20-fold greater when bound to hydrophobic intracellular sites, allowing detection of DiBAC 4 entry 

5 into the cell (Gonzalez, J.E. and P.A. Negulescu (1998) Curr. Opin. Biotechnol. 9:624-631). 

Candidate agonists or antagonists may be selected from known ion channel agonists or antagonists, 
peptide libraries, or combinatorial chemical libraries. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 

10 invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are intended to be within 
the scope of the following claims. 
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What is claimed is: 

i 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group 
5 consisting of SEQ ID NO: 1-32, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, 

c) a biologically active fragment of a polypeptide having an amino acid sequence 
10 selected from the group consisting of SEQ ID NO: 1 -32, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. 



2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
15 group consisting of SEQ ID NO: 1-32. 



3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:33-64. 



6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
25 polynucleotide of claim 3 . 



7. A cell transformed with a recombinant polynucleotide of claim 6. 



8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 

9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
35 encoding the polypeptide of claim 1, and 
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b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. 

1 1. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of 
SEQIDNO:33-64, 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, 
and which probe specifically hybridizes to said target polynucleotide, under 
conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
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10 



reaction amplification, and 
b) detecting the presence or absence of said amplified target polynucleotide or 
fragment thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutical^ acceptable 
excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. 

19. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment the 
composition of claim 17. 

15 20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 

claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20 21. A composition comprising an agonist compound identified by a method of claim 20 and 

a pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment a 

25 composition of claim 21 . 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
30 b) detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 
and a pharmaceutically acceptable excipient. 

35 25. A method for treating a disease or condition associated with overexpression of 
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functional TRICH, comprising administering to a patient in need of such treatment a 
composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of 
5 claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
suitable conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

10 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
conditions permissive for the activity of the polypeptide of claim 1, 
15 b) assessing the activity of the polypeptide of claim 1 in the presence of the test 

compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 

20 presence of the test compound is indicative of a compound that modulates the 

activity of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the 

25 method comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
30 amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe 

35 comprising at least 20 contiguous nucleotides of a polynucleotide of claim 12 under 



135 



WO 02/46415 



PCT/US01/46963 



conditions whereby a specific hybridization complex is formed between said probe 
and a target polynucleotide in the biological sample, said target polynucleotide 
comprising a polynucleotide sequence of a polynucleotide of claim 12 or fragment 
thereof, 

5 c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample 
with the amount of hybridization complex in an untreated biological sample, 
wherein a difference in the amount of hybridization complex in the treated 
biological sample is indicative of toxicity of the test compound. 

10 

30. A diagnostic test for a condition or disease associated with the expression of TRICH in 
a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 11, under conditions 
suitable for the antibody to bind the polypeptide and form an antibody:polypeptide 

15 complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 



31. The antibody of claim 11, wherein the antibody is: 
20 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 



25 



32. A composition comprising an antibody of claim 11 and an acceptable excipient. 



33. A method of diagnosing a condition or disease associated with the expression of 
TRICH in a subject, comprising administering to said subject an effective amount of the 

30 composition of claim 32. 

34. A composition of claim 32, wherein the antibody is labeled. 



35. A method of diagnosing a condition or disease associated with the expression of 
35 TRICH in a subject, comprising administering to said subject an effective amount of the 
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composition of claim 34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of 
claim 1 1, the method comprising: 

5 a) immunizing an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-32, or an immunogenic 
fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a 

10 polyclonal antibody which binds specifically to a polypeptide comprising an amino 

acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

37. A polyclonal antibody produced by a method of claim 36. 

15 38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of 
claim 1 1, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
20 selected from the group consisting of SEQ ID NO: 1-32, or an immunogenic 

fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

25 d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which binds specifically to a 

polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-32. 

30 40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 



35 



42. The antibody of claim 11, wherein the antibody is produced by screening a Fab 
expression library. 

137 



WO 02/46415 



PCT/US01/46963 



43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: 1-32 in a sample, the method comprising: 

a) incubating the antibody of claim 11 with a sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting 

10 of SEQ ID NO: 1-32 in the sample. 

45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
15 specific binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. 

20 46. A microarray wherein at least one element of the microarray is a polynucleotide of 

claim 13. 

47. A method of generating an expression profile of a sample which contains 
polynucleotides, the method comprising: 

25 a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 
hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

30 

48. An array comprising different nucleotide molecules affixed in distinct physical 
locations on a solid substrate, wherein at least one of said nucleotide molecules comprises a 
first oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 

contiguous nucleotides of a target polynucleotide, and wherein said target polynucleotide is 
35 a polynucleotide of claim 12. 
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49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target 
polynucleotide. 

5 50. An anay of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely complementary to at least 60 contiguous nucleotides of said target 
polynucleotide. 

5 1 . An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

55. An array of claim .48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct 
physical location have the same sequence, and each distinct physical location on the 

te contains nucleotide molecules having a sequence which differs from the sequence 
nucleotide molecules at another distinct physical location on the substrate. 

56. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 . 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 
30 58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 



10 



15 



20 



substrat 
of 

25 



35 



60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 
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61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6, 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 
5 63. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 10. 

10 

66. A polypeptide of claim 1 , comprising die amino acid sequence of SEQ ID NO: 1 1 . 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 
15 68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

20 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 
25 73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

30 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ JD NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

35 78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 23. 
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79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 
15 86. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 

87. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO.32. 

S8. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:34. 

25 90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:36. 

30 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:38. 
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94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:40. 

96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO-.41. 

10 97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:43. 

15 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:44. 

100. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID' 
20 NO:45. 

» 

101. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:46. 

25 102. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:47. 

103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:48. 

30 

104. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:49. 

105. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:50. 
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106. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:51. 

107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:52. 

108. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:53. 

10 109. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:54. 

1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:55. 

15 

1 1 1 . A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:56. 

1 12. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:57. 

1 13. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:58. 

25 1 14. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

115. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:60. 

30 

1 16. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:61. 

1 17. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:62. 
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1 18. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:63. 

1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:64. 
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<213> Homo sapiens 
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<221> misc_feature 

<223> Incyte ID No: 7484831CD1 

<400> 1 

Met Glu Pro Gly Asp Ala Ala Arg Pro Gly Ser Gly Arg Ala Thr 

15 10 15 

Gly Ala Pro Pro Pro Arg Leu Leu Leu Leu Pro Leu Leu Leu Gly 

20 25 30 

Trp Gly Leu Arg Val Ala Ala Ala Ala Ser Ala Ser Ser Ser Gly 
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35 


40 




45 


Ala 


Ala 


Ala 


Glu 


Asp 


ber ser Ala Met Glu uiu uexx Aia 


Thr 


Glu 










50 


55 




60 


Lys 


Glu 


Ala 


Glu 


Glu 


ber His Arg Gin Asp ber vai ber 


Leu 


Leu 










65 


70 




75 


Thr 


Phe 


He 


Leu 


Leu 


Leu Thr Leu Thr He Leu Thr He 


Trp Leu 










80 


85 




90 


Phe 


Lys 


His 


Arg 


Arg 


Val Arg Phe Leu His Glu Thr Gly 


Leu 


Ala 










95 


100 




105 


Met 


lie 


Tyr 


Gly 


Leu 


He Val Gly Val He Leu Arg Tyr 


Gly Thr 










110 


115 




120 


Pro 


Ala 


Thr 


Ser 


Gly 


Arg Asp Lys Ser Leu Ser Cys Thr 


Gin 


Glu 










125 


130 




135 


Asp 


Arg 


Ala 


Phe 


Ser 


Thr Leu Leu Val Asn Val Ser Gly 


Lys 


Phe 










140 


145 




150 


Phe 


Glu 


Tyr 


Thr 


Leu 


Lys Gly Glu He Ser Pro Gly Lys 


He 


Asn 










155 


160 




165 


Ser 


Val 


Glu 


Gin 


Asn 


Asp Met Leu Arg Lys Val Thr Phe 


Asp 


Pro 










170 


175 




180 


Glu 


Val 


Phe 


Phe 


Asn 


He Leu Leu Pro Pro He He Phe 


His 


Ala 










185 


190 




195 


Gly 


Tyr 


Ser 


Leu 


Lys 


Lys Arg His Phe Phe Arg Asn Leu 


Gly Ser 










200 


205 




210 


lie 


Leu 


Ala 


Tyr 


Ala 


Phe Leu Gly Thr Ala Val Ser Cys 


Phe 


lie 










215 


220 




225 


lie 


Gly 


Asn 


Leu 


Met 


Tyr Gly Val Val Lys Leu Met Lys 


He 


Met 










230 


235 




240 


Gly 


Gin 


Leu 


Ser 


Asp 


Lys Phe Tyr Tyr Thr Asp Cys Leu 


Phe 


Phe 










245 


250 




255 


Gly 


Ala 


He 


He 


Ser 


Ala Thr Asp Pro Val Thr Val Leu 


Ala 


lie 










260 


265 




270 


Phe 


Asn 


Glu 


Leu 


His 


Ala Asp Val Asp Leu Tyr Ala Leu 


Leu 


Phe 










275 


280 




285 


Gly 


Glu 


Ser 


Val 


Leu 


Asn Asp Ala Val Ala He Val Leu 


Ser 


Ser 










290 


295 




300 


Ser 


He 


Val 


Ala 


Tyr 


Gin Pro Ala Gly Leu Asn Thr His 


Ala 


Phe 










305 


310 




315 


Asp 


Ala 


Ala 


Ala 


Phe 


Phe Lys Ser Val Gly He Phe Leu 


Gly lie 










320 


325 




330 


Phe 


Ser 


Gly 


Ser 


Phe 


Thr Met Gly Ala Val Thr Gly Val 


Val 


Thr 










335 


340 




345 


Ala 


Leu 


Val 


Thr 


Lys 


Phe Thr Lys Leu His Cys Phe Pro 


Leu 


Leu 










350 


355 




360 


Glu 


Thr 


Ala 


Leu 


Phe 


Phe Leu Met Ser Trp Ser Thr Phe 


Leu 


Leu 










365 


370 




375 


Ala 


Glu 


Ala 


Cys 


Gly 


Phe Thr Gly Val Val Ala Val Leu 


Phe 


Cys 










380 


385 




390 


Gly 


He 


Thr 


Gin 


Ala 


His Tyr Thr Tyr Asn Asn Leu Ser 


Val 


Glu 








395 


400 




405 


Ser 


Arg 


Ser 


Arg 


Thr 


Lys Gin Leu Phe Glu Val Leu His 


Phe 


Leu 










410 


415 




420 


Ala 


Glu 


Asn 


Phe 


He 


Phe Ser Tyr Met Gly Leu Ala Leu 


Phe 


Thr 










425 


430 




435 


Phe 


Gin 


Lys 


His 


Val 


Phe Ser Pro He Phe lie lie Gly 


Ala 


Phe 








440 


445 




450 


Val 


Ala 


He 


Phe 


Leu 


Gly Arg Ala Ala His lie Tyr Pro 


Leu 


Ser 










455 


460 




465 


Phe 


Phe 


Leu 


Asn 


Leu 


Gly Arg Arg His Lys lie Gly Trp 


Asn 


Phe 










470 


475 




480 


Gin 


His 


Met 


Met 


Met 


Phe Ser Gly Leu Arg Gly Ala Met 


Ala 


Phe 










485 


490 




495 


Ala 


Leu 


Ala 


He 


Arg 


Asp Thr Ala Ser Tyr Ala Arg Gin 


Met 


Met 










500 


505 




510 
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Phe Thr Thr 
Gly Gly Gly 
Gly Val Glu 
Gin Tyr Phe 
Asn Asn Asp 
Ala Arg Gly 
Leu Trp Tyr 
His Ser Gly 
Leu Leu Ala 
Glu Pro Leu 
Asp Leu Thr 
Ser Ser Ser 
Arg Thr Lys 
Met Gly Asp 
Pro Leu Glu 



Thr Leu 

515 
Thr Thr 

530 
Glu Pro 

545 
Arg Val 

560 
Ser Phe 

575 
Asn Arg 

590 
Ser Phe 

605 
Pro Pro 

620 
Arg Cys 

635 
Arg Glu 

650 
Leu Thr 

665 
Ser His 

680 
Ser Ser 

695 
Gin Lys 

710 
Asp Asn 

725 



Leu He Val 
Pro Met Leu 
Ser Glu Glu 
Gly Val Asp 
Gin Val Leu 
Thr Lys Gin 
Asp His Asn 
Leu Thr Thr 
Leu Thr Ser 
Glu Asp Ser 
Tyr Gly Asp 
Thr Ala Ser 
Ser Glu Glu 
Val Ser Ser 
Ala 



Phe Phe 

520 
Ser Trp 

535 
Asp Gin 

550 
Pro Asp 

565 
Gin Gly 

580 
Glu Ser 

595 
Tyr Leu 

610 
Thr Leu 

625 
Pro Gin 

640 
Asp Phe 

655 
Ser Thr 

670 
Thr Ser 

685 
Val Leu 

700 
Arg Gly 

715 



Thr Val 
Leu Asn 
Asn Glu 
Gin Asp 
Asp Gly 
Thr Trp 
Lys Pro 
Pro Ala 
Val Tyr 
He Leu 
Val Thr 
Leu Glu 
Glu Arg 
Thr Arg 



Trp He He 
525 

He Arg Val 
540 

His His Trp 
555 

Pro Pro Pro 
570 

Pro Asp Ser 
585 

lie Phe Arg 
600 

He Leu Thr 
615 

Trp Cys Gly 
630 

Asp Asn Gin 
645 

Thr Glu Gly 
660 

Ala Asn Gly 
675 

Gly Ser Arg 
690 

Asp Leu Gly 
705 

Leu Val Phe 
720 



<210> 2 

<211> 1081 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2477266CD1 



<400> 2 














Met 


Glu 


Ser 


Ala 


Gly 


Leu 


Glu 


Gin Leu Leu Arg Glu Leu Leu Leu 


1 








5 






10 15 


Pro 


Asp 


Thr 


Glu 


Arg 
20 


He 


Arg 


Arg Ala Thr Glu Gin Leu Gin He 
25 30 


Val 


Leu 


Arg 


Ala 


Pro 
35 


Ala 


Ala 


Leu Pro Ala Leu Cys Asp Leu Leu 
40 45 


Ala 


Ser 


Ala 


Ala 


Asp 
50 


Pro 


Gin 


He Arg Gin Phe Ala Ala Val Leu 
55 60 


Thr 


Arg 


Arg 


Arg 


Leu 
65 


Asn 


Thr 


Arg Trp Arg Arg Leu Ala Ala Glu 
70 75 


Gin 


Arg 


Glu 


Ser 


Leu 


Lys 


Ser 


Leu He Leu Thr Ala Leu Gin Arg 








80 






85 90 


Glu 


Thr 


Glu 


His 


Cys 
95 


Val 


Ser 


Leu Ser Leu Ala Gin Leu Ser Ala 
100 105 


Thr 


He 


Phe 


Arg 


Lys 
110 


Glu 


Gly 


Leu Glu Ala Trp Pro Gin Leu Leu 
115 120 


Gin 


Leu 


Leu 


Gin 


His 
125 


Ser 


Thr 


His Ser Pro His Ser Pro Glu Arg 
130 135 


Glu 


Met 


Gly 


Leu 


Leu 


Leu 


Leu 


Ser Val Val Val Thr Ser Arg Pro 








140 






145 150 


Glu 


Ala 


Phe 


Gin 


Pro 
155 


His 


His 


Arg Glu Leu Leu Arg Leu Leu Asn 
160 165 
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Glu Thr Leu Gly Glu Val Gly Ser Pro Gly Leu Leu Phe Tyr Ser 

170 175 180 

Leu Arg Thr Leu Thr Thr Met Ala Pro Tyr Leu Ser Thr Glu Asp 

185 190 195 

Val Pro Leu Ala Arg Met Leu Val Pro Lys Leu He Met Ala Met 

200 205 210 

Gin Thr Leu He Pro He Asp Glu Ala Lys Ala Cys Glu Ala Leu 

215 220 225 

Glu Ala Leu Asp Glu Leu Leu Glu Ser Glu Val Pro Val He Thr 

230 235 240 

Pro Tyr Leu Ser Glu Val Leu Thr Phe Cys Leu Glu Val Ala Arg 

245 250 255 

Asn Val Ala Leu Gly Asn Ala He Arg He Arg He Leu Cys Cys 

260 265 270 

Leu Thr Phe Leu Val Lys Val Lys Ser Lys Ala Leu Leu Lys Asn 

275 280 285 

Arg Leu Leu Pro Pro Leu Leu His Thr Leu Phe Pro lie Val Ala 

290 295 300 

Ala Glu Pro Pro Pro Gly Gin Leu Asp Pro Glu Asp Gin Asp Ser 

305 310 315 

Glu Glu Glu Glu Leu Glu He Glu Leu Met Gly Glu Thr Pro Lys 

320 325 330 

His Phe Ala Val Gin Val Val Asp Met Leu Ala Leu His Leu Pro 

335 340 345 

Pro Glu Lys Leu Cys Pro Gin Leu Met Pro Met Leu Glu Glu Ala 

350 355 360 

Leu Arg Ser Glu Ser Pro Tyr Gin Arg Lys Ala Gly Leu Leu Val 

365 370 375 

Leu Ala Val Leu Ser Asp Gly Ala Gly Asp His He Arg Gin Arg 

380 385 390 

Leu Leu Pro Pro Leu Leu Gin He Val Cys Lys Gly Leu Glu Asp 

395 400 405 

Pro Ser Gin Val Val Arg Asn Ala Ala Leu Phe Ala Leu Gly Gin 

410 415 420 

Phe Ser Glu Asn Leu Gin Pro His He Ser Ser Tyr Ser Arg Glu 

425 430 435 

Val Met Pro Leu Leu Leu Ala Tyr Leu Lys Ser Val Pro Leu Gly 

440 445 450 

His Thr His His Leu Ala Lys Ala Cys Tyr Ala Leu Glu Asn Phe 

455 460 465 

Val Glu Asn Leu Gly Pro Lys Val Gin Pro Tyr Leu Pro Glu Leu 

470 475 480 

Met Glu Cys Met Leu Gin Leu Leu Arg Asn Pro Ser Ser Pro Arg 

485 490 495 

Ala Lys Glu Leu Ala Val Ser Ala Leu Gly Ala He Ala Thr Ala 

500 505 510 

Ala Gin Ala Ser Leu Leu Pro Tyr Phe Pro Ala He Met Glu His 

515 520 525 

Leu Arg Glu Phe Leu Leu Thr Gly Arg Glu Asp Leu Gin Pro Val 

530 535 540 

Gin He Gin Ser Leu Glu Thr Leu Gly Val Leu Ala Arg Ala Val 

545 550 555 

Gly Glu Pro Met Arg Pro Leu Ala Glu Glu Cys Cys Gin Leu Gly 

560 565 570 

Leu Gly Leu Cys Asp Gin Val Asp Asp Ala Asp Leu Arg Arg Cys 

575 580 585 

Thr Tyr Ser Leu Phe Ala Ala Leu Ser Gly Leu Met Gly Glu Gly 

590 595 600 

Leu Ala Pro His Leu Glu Gin He Thr Thr Leu Met Leu Leu Ser 

605 610 615 

Leu Arg Ser Thr Glu Gly He Val Pro Gin Tyr Asp Gly Ser Ser 

620 625 630 

Ser Phe Leu Leu Phe Asp Asp Glu Ser Asp Gly Glu Glu Glu Glu 
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635 








640 




645 


Glu Leu 


Met 


Asp 


Glu 


Asp 


Val 


Glu 


Glu Glu Asp Asp 


Ser 


Glu He 








650 








655 




660 


Ser Gly 


Tyr 


Ser 


Val 


Glu 


Asn 


Ala 


Phe Phe Asp Glu 


Lys 


Glu Asp 








665 








670 




675 


Thr Cys 


Ala 


Ala 


Val 


Gly 


Glu 


He 


Ser Val Asn Thr 


Ser 


Val Ala 






680 








685 




690 


Phe Leu 


Pro 


Tyr 


Met 


Glu 


Ser 


Val 


Phe Glu Glu Val 


Phe 


Lys Leu 








695 








700 




705 


Leu Glu 


Cys 


Pro 


His 


Leu 


Asn 


Val 


Arg Lys Ala Ala 


His 


Glu Ala 






710 








715 




720 


Leu Gly 


Gin 


Phe 


Cys 


Cys 


Ala 


Leu 


His Lys Ala Cys 


Gin 


Ser Cys 






725 








730 




735 


Pro Ser 


Glu 


Pro 


Asn 


Thr 


Ala 


Ala 


Leu Gin Ala Ala 


Leu 


Ala Arg 








740 








745 




750 


Val Val 


Pro 


Ser 


Tyr 


Met 


Gin 


Ala 


Val Asn Arg Glu 


Arg 


Glu Arg 








755 








760 




765 


Gin Val 


Val 


Met 


Ala 


Val 


Leu 


Glu 


Ala Leu Thr Gly 


Val 


Leu Arg 








770 








775 




780 


Ser Cys 


Gly 


Thr 


Leu 


Thr 


Leu 


Lys 


Pro Pro Gly Arg 


Leu 


Ala Glu 








785 








790 




795 


Leu Cys 


Gly 


Val 


Leu 


Lys 


Ala 


Val 


Leu Gin Arg Lys 


Thr 


Ala Cys 








800 








805 




810 


Gin Asp 


Thr 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu Glu Asp Asp 


Asp 


Gin Ala 








815 








820 




825 


Glu Tyr 


Asp 


Ala 


Met 


Leu 


Leu 


Glu 


His Ala Gly Glu 


Ala 


He Pro 




830 








835 




840 


Ala Leu 


Ala 


Ala 


Ala 


Ala 


Gly 


Gly 


Asp Ser Phe Ala 


Pro 


Phe Phe 








845 








850 




855 


Ala Gly 


Phe 


Leu 


Pro 


Leu 


Leu 


Val 


Cys Lys Thr Lys 


Gin Gly Cys 






860 








865 




870 


Thr Val 


Ala 


Glu 


Lys 


Ser 


Phe 


Ala 


Val Gly Thr Leu 


Ala 


Glu Thr 








875 








880 




885 


He Gin 


Gly 


Leu 


Gly 


Ala 


Ala 


Ser 


Ala Gin Phe Val 


Ser 


Arg Leu 






890 








895 




900 


Leu Pro 


Val 


Leu 


Leu 


Ser 


Thr 


Ala 


Gin Glu Ala Asp 


Pro 


Glu Val 








905 








910 




915 


Arg Ser 


Asn 


Ala 


He 


Phe 


Gly 


Met 


Gly Val Leu Ala 


Glu 


His Gly 








920 








925 




930 


Gly His 


Pro 


Ala 


Gin 


Glu 


His 


Phe 


Pro Lys Leu Leu 


Gly Leu Leu 






935 








940 




945 


Phe Pro 


Leu 


Leu 


Ala 


Arg 


Glu 


Arg 


His Asp Arg Val 


Arg Asp Asn 








950 








955 




960 


He Cys 


Gly 


Ala 


Leu 


Ala 


Arg 


Leu 


Leu Met Ala Ser 


Pro 


Thr Arg 






965 








970 




975 


Lys Pro 


Glu 


Pro 


Gin 


Val 


Leu 


Ala 


Ala Leu Leu His 


Ala 


Leu Pro 






980 








985 




990 


Leu Lys 


Glu 


Asp 


Leu 


Glu 


Glu 


Trp 


Val Thr He Gly 


Arg Leu Phe 






995 








1000 




1005 


Ser Phe 


Leu 


Tyr 


Gin 


Ser 


Ser 


Pro 


Asp Gin Val He 


Asp Val Ala 






1010 








1015 




1020 


Pro Glu 


Leu 


Leu Arg 


He 


Cys 


Ser 


Leu He Leu Ala 


Asp Asn Lys 






1025 








1030 




1035 


He Pro 


Pro 


Asp Thr 


Lys 


Ala 


Ala 


Leu Leu Leu Leu 


Leu 


Thr Phe 






1040 








1045 




1050 


Leu Ala 


Lys 


Gin 


His 


Thr 


Asp 


Ser 


Phe Gin Ala Ala 


Leu 


Gly Ser 




1055 








1060 




1065 


Leu Pro 


Val 


Asp 


Lys 


Ala 


Gin 


Glu 


Leu Gin Ala Val 


Leu Gly Leu 






1070 








1075 




1080 



Ser 

<210> 3 
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<211> 1172 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



<400> 3 








Met 


Tyr 


Val 


Arg 


val 


1 










HIS 


Leu 


Met 


Thr 


Lys 
zU 


Ti- 
ne 


Ser 


Val 


His 


Gly 

35 


Leu 


Lys 


Gin 


Val 


Phe 

50 


Tnr 


Ol mm 

\3iy 


Ala 


Trp 


He 


Arg 


HIS 


Val 


Gly 


Asp 
80 


Gly 


Lys 


He 


Cys 


Thr 
95 


Asn 


Gin 


Glu 


Asp 


Leu 
110 


Thr 


Met 


Ser 


Asn 


Pro 
125 


His 


Ser 


His 


Phe 


He 
140 


Gly 


Ala 


Glu 


Val 


Lys 
155 


Leu 


Gin 


Lys 


He 


Asn 
170 


Ala 


Leu 


He 


Val 


Glu 
185 


Glu 


Tyr 


Leu 


Arg 


Asp 
200 


Gly 


Ser 


Gly 


Arg 


Ala 
215 


Ser 


Glu 


Glu 


Gly 


Gly 
230 


Leu 


Val 


Thr 


He 


Gin 
245 


Gin 


His 


Leu 


Phe 


He 

1 C f\ 

260 


Leu 


He 


Thr 


Val 


Phe 
275 


Asp 


Leu 


Ala 


He 


Leu 
290 


Ala 


Pro 


Asp 


Gin 


Leu 
305 


He 


Ala 


Arg 


Ser 


Gin 
320 


Gly 


Ser 


Leu 


Glu 


Gin 
335 


Val 


Asp 


Phe 


Val 


Lys 
350 


Arg 


Phe 


Leu 


Thr 


He 
365 


His 


Gly 


Pro 


Ser 


Asn 
380 


Lys 


Gly 


Asn 


Leu 


Pro 
395 



3552033CD1 



Ser 


pne 


Asp 


Tnr 


Lys 
10 


GIU 


Trp 


Gin 


Leu 


Glu 
25 


Gly 


Leu 


Gin 


Asn " 


Phe 
40 


Gly 


Lys 


Gly 


Leu 


He 
55 


Phe 


Thr 


Gly 


Gly 


val 
70 


Ala 


Leu 


Lys 


Asp 


HIS 

85 


He 


Gly 


He 


T.1 . 
Ala 


Pro 
100 


He 


Gly 


Arg 


Asp 


Val 
115 


Met 


Ser 


Lys 


Leu 


Thr 
130 


Leu 


Ala 


Asp 


Asn 


Gly 
145 


Leu 


Arg 


Arg 


Gin 


Leu 
160 


Thr 


Arg 


He 


Gly 


Gin 
175 


Gly 


Gly 


Pro 


Asn 


Val 
190 


Thr 


Pro 


Pro 


Val 


Pro 
205 


Ser 


Asp 


lie 


Leu 


Ala 

220 


Leu 


He 


Asn 


Glu 


Ser 
235 


i»ys 


Thr 


Pne 


Thr 


Tyr 
250 


He 


Leu 


Met 


Glu 


Cys 
265 


Arg 


Met 


Gly 


Ser 


Glu 
280 


Thr 


Ala 


Leu 


Leu 


Lys 
295 


Ser 


Leu 


Ala 


Leu 


Ala 
310 


He 


Phe 


He 


Tyr 


Gly 
325 


Ala 


Met 


Leu 


Asp 


Ala 
340 


Leu 


Leu 


He 


Glu 


Asn 
355 


Ser 


Arg 


Leu 


Glu 


Glu 
370 


Thr 


Leu 


Tyr 


His 


Leu 
385 


Pro 


Asp 


Tyr 


Arg 


He 
400 



Pro 


Asp 


Leu 


Leu 


Leu 
15 


Leu 


Pro 


Lys 


Leu 


Leu 
30 


Glu 


Leu 


Gin 


Pro 


Lys 
45 


Lys 


Ala 


Ala 


Met 


Thr 








60 


Asn 


Thr 


Gly 


Val 


75 


Ala 


Ser 


Lys 


Ser 


Arg 
90 


Trp 


Gly 


He 


Val 


Glu 
105 


Val 


Arg 


Pro 


Tyr 


Gin 
120 


Val 


Leu 


Asn 


Ser 


Met 
135 


Thr 


Thr 


Gly 


Lys 


Tyr 
150 


Glu 


Lys 


His 


He 


Ser 
165 


Gly 


Val 


Pro 


Val 


Val 
180 


He 


Ser 


He 


Val 


Leu 
195 


Val 


Val 


Val 


Cys 


Asp 
210 


Phe 


Gly 


HIS 


Lys 


Tyr 
225 


Leu 


Arg 


Asp 


Gin 


Leu 
240 


Thr 


Arg 


Thr 


Gin 


Ala 
255 


Met 


Lys 


Lys 


Lys 


Glu 
270 


Gly 


His 


Gin 


Asp 


He 
285 


Gly 


Ala 


Asn 


Ala 


Ser 
300 


Trp 


Asn 


Arg 


Val 


Asp 
315 


Gin 


Gin 


Trp 


Pro 


Val 
330 


Leu 


Val 


Leu 


Asp 


Arg 
345 


Gly 


Val 


Ser 


Met 


His 
360 


Leu 


Tyr 


Asn 


Thr 


Arg 
375 


Val 


Arg 


Asp 


Val 


Lys 
390 


Ser 


Leu 


He 


Asp 


He 
405 
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Gly Leu Val lie Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn 

410 415 420 

Tyr Thr Arg Lys Arg Phe Arg Thr Leu Tyr His Asn Leu Phe Gly 

425 430 435 

Pro Lys Arg Pro Lys Ala Leu Lys Leu Leu Gly Met Glu Asp Asp 

440 445 450 

He Pro Leu Arg Arg Gly Arg Lys Thr Thr Lys Lys Arg Glu Glu 

455 460 465 

Glu Val Asp He Asp Leu Asp Asp Pro Glu He Asn His Phe Pro 

470 475 480 

Phe Pro Phe His Glu Leu Met Val Trp Ala Val Leu Met Lys Arg 

485 490 495 

Gin Lys Met Ala Leu Phe Phe Trp Gin His Gly Glu Glu Ala Met 

500 505 510 

Ala Lys Ala Leu Val Ala Cys Lys Leu Cys Lys Ala Met Ala His 

515 520 525 

Glu Ala Ser Glu Asn Asp Met Val Asp Asp He Ser Gin Glu Leu 

530 535 540 

Asn His Asn Ser Arg Asp Phe Gly Gin Leu Ala Val Glu Leu Leu 

545 550 555 

Asp Gin Ser Tyr Lys Gin Asp Glu Gin Leu Ala Met Lys Leu Leu 

560 565 570 

Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ala Thr Cys Leu Gin Leu 

575 580 585 

Ala Val Ala Ala Lys His Arg Asp Phe He Ala His Thr Cys Ser 

590 595 600 

Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg Met Arg 

605 610 615 

Lys Asn Ser Gly Leu Lys Val He Leu Gly He Leu Leu Pro Pro 

620 625 630 

Ser He Leu Ser Leu Glu Phe Lys Asn Lys Asp Asp Met Pro Tyr 

635 640 645 

Met Ser Gin Ala Gin Glu He His Leu Gin Glu Lys Glu Ala Glu 

650 655 660 

Glu Pro Glu Lys Pro Thr Lys Glu Lys Glu Glu Glu Asp Met Glu 

665 670 675 

Leu He Ala Met Leu Gly Arg Asn Asn Gly Glu Ser Ser Arg Lys 

680 685 690 

Lys Asp Glu Glu Glu Val Gin Ser Glu His Arg Leu He Pro Leu 

695 700 705 

Gly Arg Lys He Tyr Glu Phe Tyr Asn Ala Pro He Val Lys Phe 

710 715 720 

Trp Phe Tyr Thr Leu Ala Tyr He Gly Tyr Leu Met Leu Phe Asn 

725 730 735 

Tyr He Val Leu Val Lys Met Glu Arg Trp Pro Pro Thr Gin Glu 

740 745 750 

Trp He Val He Ser Tyr He Phe Thr Leu Gly He Glu Lys Met 

755 760 765 

Arg Glu He Leu Met Ser Glu Pro Gly Lys Leu Leu Gin Lys Val 

770 775 780 

Lys Val Trp Leu Gin Glu His Trp Asn Val Thr Asp Leu He Ala 

785 790 795 

He Leu Leu Phe Ser Val Gly Met He Leu Arg Leu Gin Asp Gin 

800 805 810 

Pro Phe Arg Ser Asp Gly Arg Val He Tyr Cys Val Asn He He 

815 820 825 

Tyr Trp Tyr lie Arg Leu Leu Asp lie Phe Gly Val Asn Lys Tyr 

830 835 840 

Leu Gly Pro Tyr Val Met Met lie Gly Lys Met Met lie Asp Met 

845 850 855 

Met Tyr Phe Val lie lie Met Leu Val Val Leu Met Ser Phe Gly 

860 865 870 

Val Ala Arg Gin Ala lie Leu Phe Pro Asn Glu Glu Pro Ser Trp 
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875 



Lys 


Leu 


Ala 


Lys Asn 








oyu 


Gly 


G1U 


vai 


rne Aia 










Glu 


Thr 


Arg 


Glu Asp 








ft O A 

920 


Thr 


Gly 


Ala 


Trp He 








ft o c 

935 


Val 


Ala 


Asn 


He Leu 








950 


Asn 


Thr 


Phe 


Phe Glu 








965 


Phe 


Gin 


Arg 


Tyr Gin 








980 


Leu 


Pro 


Pro 


Pro Leu 








995 


Gin 


His 


Leu 


Cys Cys 








1010 


Glu 


Arg 


Asp 


Tyr Gly 








1025 


Lys 


Lys 


Val 


His Asp 








1040 


Arg 


Glu 


Lys 


Asp Asp 








1055 


Arg 


Val 


Thr 


Ser Glu 








1070 


Glu 


Val 


Asn 


Glu Arg 








1085 


Val 


Asp 


He 


Arg Leu 








1100 


Ala 


Thr 


Ala 


Leu Glu 








1115 


Asn 


Lys 


He 


Arg Ser 








1130 


Tyr 


He 


Val 


Arg Gin 








1145 


Phe 


Lys 


Leu 


Gin Glu 








1160 


Ser 


Val 











880 


He 


Phe Tyr Met Pro 






895 


Asp 


Gin He Asp Pro 






910 


Gly 


Lys He 


He Gin 






925 


Val 


Pro Ala 


lie Met 






940 


Leu 


Val Asn 


Leu Leu 






955 


Val 


Lys Ser 


He Ser 






970 


Leu 


He Met 


Thr Phe 






985 


He 


He Phe 


Ser His 






1000 


Arg 


Trp Arg 


Lys His 






1015 


Leu 


Lys Leu 


Phe He 






1030 


Phe 


Glu Glu 


Gin Cys 






1045 


Arg 


Phe Asn 


Ser Ser 






1060 


Arg 


Val Glu 


Asn Met 






1075 


Glu 


His Ser 


Met Lys 






1090 


Ala 


Gin Leu Glu Asp 






1105 


Arg 


Leu Thr Gly Leu 






1120 


Arg 


Thr Ser 


Ser Asp 






1135 


Ser 


Ser Phe 


Asn Ser 






1150 


Ser 


He Asp Pro Ala 






1165 







885 


Tyr 


Trp Met 


lie Tyr 






900 


Pro Cys Gly 


Gin Asn 






915 


Leu 


Pro Pro 


Cys Lys 






930 


Ala 


Cys Tyr 


Leu Leu 






945 


lie 


Ala Val 


Phe Asn 






960 


Asn 


Gin Val 


Trp Lys 






975 


His 


Glu Arg 


Pro Val 






990 


Met 


Thr Met 


lie Phe 






1005 


Glu 


Ser Asp 


Pro Asp 






1020 


Thr 


Asp Asp 


Glu Leu 






1035 


He 


Glu Glu 


Tyr Phe 






1050 


Asn 


Asp Glu 


Arg lie 






1065 


Ser 


Met Arg 


Leu Glu 






1080 


Ala 


Ser Leu 


Gin Thr 






1095 


Leu 


lie Gly 


Arg Met 






1110 


Glu Arg Ala 


Glu Ser 






1125 


Cys Thr Asp 


Ala Ala 






1140 


Gin Glu Gly 


Asn Thr 






1155 


Glu 


His Pro 


Leu Tyr 






1170 



<210> 4 

<211> 742 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4778139CD1 

<400> 4 



Met 


Glu 


lie Ala 


Gly Glu 


Lys 


Lys Tyr 


Gin Tyr Tyr Ser 


Met Pro 


1 






5 






10 


15 


Ser 


Ser 


Leu Leu 


Arg Ser 


Leu 


Ala Gly 


Gly Arg His Pro 


Pro Arg 








20 






25 


30 


Val 


Gin 


Arg Gly 


Ala Leu 


Leu 


Glu He 


Thr Asn Ser Lys 


Arg Glu 








35 






40 


45 


Ala 


Thr 


Asn Val 


Arg Asn 


Asp 


Gin Glu 


Arg Gin Glu Thr 


Gin Ser 








50 






55 


60 


Ser 


lie 


Val Val 


Ser Gly 


Val 


Ser Pro 


Asn Arg Gin Ala 


His Ser 








65 






70 


75 


Lys 


Tyr 


Gly Gin 


Phe Leu 


Leu 


Val Pro 


Ser Asn Leu Lys 


Arg Val 
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80 85 90 

Pro Phe Ser Ala Glu Thr Val Leu Pro Leu Ser Arg Pro Ser Val 
95 100 105 

Pro Asp Val Leu Ala Thr Glu Gin Asp lie Gin Thr Glu Val Leu 
110 115 120 

Val His Leu Thr Gly Gin Thr Pro Val Val Ser Asp Trp Ala Ser 
125 130 135 

Val Asp Glu Pro Lys Glu Lys His Glu Pro He Ala His Leu Leu 
140 145 150 

Asp Gly Gin Asp Lys Ala Glu Gin Val Leu Pro Thr Leu Ser Cys 
155 160 165 

Thr Pro Glu Pro Met Thr Met Ser Ser Pro Leu Ser Gin Ala Lys 
170 175 180 

He Met Gin Thr Gly Gly Gly Tyr Val Asn Trp Ala Phe Ser Glu 
185 190 195 

Gly Asp Glu Thr Gly Val Phe Ser He Lys Lys Lys Trp Gin Thr 
200 205 210 

Cys Leu Pro Ser Thr Cys Asp Ser Asp Ser Ser Arg Ser Glu Gin 
215 220 225 

His Gin Lys Gin Ala Gin Asp Ser Ser Leu Ser Asp Asn Ser Thr 
230 235 240 

Arg Ser Ala Gin Ser Ser Glu Cys Ser Glu Val Gly Pro Trp Leu 
245 250 255 

Gin Pro Asn Thr Ser Phe Trp He Asn Pro Leu Arg Arg Tyr Arg 
260 265 270 

Pro Phe Ala Arg Ser His Ser Phe Arg Phe His Lys Glu Glu Lys 
275 280 285 

Leu Met Lys He Cys Lys He Lys Asn Leu Ser Gly Ser Ser Glu 
290 295 300 

He Gly Gin Gly Ala Trp Val Lys Ala Lys Met Leu Thr Lys Asp 
305 310 315 

Arg Arg Leu Ser Lys Lys Lys Lys Asn Thr Gin Gly Leu Gin Val 
320 325 330 

Pro lie He Thr Val Asn Ala Cys Ser Gin Ser Asp Gin Leu Asn 
335 340 345 

Pro Glu Pro Gly Glu Asn Ser He Ser Glu Glu Glu Tyr Ser Lys 
350 355 ^ 360 

Asn Trp Phe Thr Val Ser Lys Phe Ser His Thr Gly Val Glu Pro 
365 370 375 

Tyr He His Gin Lys Met Lys Thr Lys Glu He Gly Gin Cys Ala 
380 385 390 

He Gin He Ser Asp Tyr Leu Lys Gin Ser Gin Glu Ser Ala Gin 
395 400 405 

Asp Leu Ser Lys Asn Ser Leu Trp Asn Ser Arg Ser Thr Asn Leu 
410 415 420 

Asn Arg Asn Ser Leu Ser Ser Leu He Ser Glu He Ser Ala Ser 
425 430 435 

Leu Lys Ser Pro Gin Glu Pro His His His Tyr Ser Pro Ser Leu 
440 445 450 

Leu Phe Ala Ala Gly Glu Glu He Thr Val Tyr Arg Leu Glu Glu 
455 460 465 

Ser Ser Pro Leu Asn Leu Asp Lys Ser Met Ser Ser Trp Ser Gin 
470 475 480 

Arg Gly Arg Ala Ala Met He Gin Val Leu Ser Arg Glu Glu Met 
485 490 495 

Asp Gly Gly Leu Arg Lys Ala Met Arg Val Val Ser Thr Trp Ser 
500 505 510 

Glu Asp Asp He Leu Lys Pro Gly Gin Val Phe He Val Lys Ser 
515 520 525 

Phe Leu Pro Glu Val Val Arg Thr Trp His Lys He Phe Gin Glu 
530 535 540 

Ser Thr Val Leu His Leu Cys Leu Arg Glu He Gin Gin Gin Arg 
545 550 555 
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Ala 


Ala 


Gin Lys 


Leu 


He 


Tyr 


Thr 


Phe 


Asn 


Gin Val Lys Pro Gin 








560 










565 


570 


Thr 


lie 


Pro Tyr 


Thr 


Pro 


Arg 


Phe 


Leu 


Glu 


Val Phe Leu He Tyr 








575 










580 


585 


Cys 


His 


Ser Ala 


Asn 


Gin 


Trp 


Leu 


Thr 


He 


Glu Lys Tyr Met Thr 








590 










595 


600 


Gly 


Glu 


Phe Arg 


Lys 


Tyr 


Asn 


Asn 


Asn 


Asn 


Gly Asp Glu He Thr 








605 










610 


615 


Pro 


Thr 


Asn Thr 


Leu 


Glu 


Glu 


Leu 


Met 


Leu 


Ala Phe Ser His Trp 








620 










625 


630 


Thr 


Tyr 


Glu Tyr 


Thr 


Arg 


Gly 


Glu 


Leu 


Leu 


Val Leu Asp Leu Gin 








635 










640 


645 


Gly 


Val 


Gly Glu 


Asn 


Leu 


Thr 


Asp 


Pro 


Ser 


Val He Lys Pro Glu 








650 










655 


660 


Val 


Lys 


Gin Ser 


Arg 


Gly 


Met 


Val 


Phe Gly 


Pro Ala Asn Leu Gly 








665 










670 


675 


Glu 


Asp 


Ala He 


Arg 


Asn 


Phe 


He 


Ala 


Lys 


His His Trp Asn Ser 








680 










685 


690 


Cys 


Cys 


Arg Lys 


Leu 


Lys 


Leu 


Pro 


Asp 


Leu 


Lys Arg Asn Asp Tyr 








695 










700 


705 


Ser 


Pro 


Glu Arg 


He 


Asn 


Ser 


Thr 


Phe Gly 


Leu Glu He Lys He 








710 










715 


720 


Glu 


Ser 


Ala Glu 


Glu 


Pro 


Pro 


Ala Arg Glu 


Thr Gly Arg Asn Ser 








725 










730 


735 


Pro 


Glu 


Asp Asp 


Met 


Gin 


Leu 
















740 















<210> 5 
<211> 577 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 4787433CD1 



<400> 5 
















Met 


Gly 


Ser 


Arg 


His 


Phe 


Glu 


Gly lie Tyr 


Asp His Val Gly His 


1 








5 






10 


15 


Phe 


Gly 


Arg 


Phe 


Gin 


Arg 


Val 


Leu Tyr Phe 


lie Cys Ala Phe Gin 










20 






25 


30 


Asn 


lie 


Ser 


Cys 


Gly 


lie 


His 


Tyr Leu Ala 


Ser Val Phe Met Gly 










35 






40 


45 


Val 


Thr 


Pro 


His 


His 


Val 


Cys 


Arg Pro Pro 


Gly Asn Val Ser Gin 










50 






55 


60 


Val 


Val 


Phe 


His 


Asn 


His 


Ser 


Asn Trp Ser 


Leu Glu Asp Thr Gly 










65 






70 


75 


Ala 


Leu 


Leu 


Ser 


Ser 


Gly 


Gin 


Lys Asp Tyr 


Val Thr Val Gin Leu 










80 






85 


90 


Gin 


Asn 


Gly 


Glu 


lie 


Trp 


Glu 


Leu Ser Arg 


Cys Ser Arg Asn Lys 










95 






100 


105 


Arg 


Glu 


Asn 


Thr 


Ser 


Ser 


Leu 


Gly Tyr Glu 


Tyr Thr Gly Ser Lys 










110 






115 


120 


Lys 


Glu 


Phe 


Pro 


Cys 


Val 


Asp 


Gly Tyr lie 


Tyr Asp Gin Asn Thr 










125 






130 


135 


Trp 


Lys 


Ser 


Thr 


Ala 


Val 


Thr 


Gin Trp Asn 


Leu Val Cys Asp Arg 










140 






145 


150 


Lys 


Trp 


Leu 


Ala 


Met 


Leu 


lie 


Gin Pro Leu 


Phe Met Phe Gly Val 










155 






160 


165 


Leu 


Leu 


Gly 


Ser 


Val 


Thr 


Phe 


Gly Tyr Phe 


Ser Asp Arg Leu Gly 










170 






175 


180 


Arg 


Arg 


Val 


Val 


Leu 


Trp 


Ala 


Thr Ser Ser 


Ser Met Phe Leu Phe 










185 






190 


195 
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Gly lie 


Ala Ala Ala 


Phe 


Ala 


Val 




200 








Ala Arg 


Phe Phe Leu 


Ala 


Met 


Val 




215 








Gly Phe 


Val Tvr Val 


Met 


Glu 


Phe 




230 








Trp Ala 


Ser Val His 


Leu 


His 


Ser 




245 








Leu Val 

UC \A VOX 


Ala Leu Thr 

oXH UCU 1 111 


Gly Tyr 


Leu 




260 








Gin Mer 

V7 X 1 1 HC L> 


Tie Leu Spr 

llw UCU OC1 


Thr 


vai 


Thr 




275 








Trp Val 


Leu Pro Glu 


Thr 


Pro 


Phe 




290 








Tvr Gl 11 


Glu Ala Gin 

VJ1U «*1 C* Oil! 


Lys 


He 


Val 




ins 








Arrr lla 


Cay Coy f*\ra 
Ocl OCX Lyo 


Lys 


Leu 


Oct 




J z u 








Gl n gi \/ 


D r « \f fl 1 Cor 
riO Vol Jsci 


Asn 


Ser 


Pit* 




US 








Leu sex* 


TH^y T oil dVi q 
I/I beU rTlc 


Tyr 


Asn 


Trn 
lip 












Th r* \7» 1 
liii Val 


J. xp Leu lie 


Trp 


Phe 


Thr 
i in 




3fi5 








Pho> Cay 


T.on Jen Coy* 
JjcU noil Del 


Val 


Asn 


Leu 




3 RO 








T.eil Phe 


ueu ueu uiy 


Val 


Val 


Glu 




395 








Pve Tl o 


si a Mot~ &en 
Axct ncL naky 


Lys 


Val 


Glv 




*± X V 








Cor T ,01 1 


p"H o Pvc Cor 
{rile \^yo Ocl 


Ala 


Leu 


Ala 




495 








tXO VjXII 


T ,i/c Ui e T*Y/"r" 

iiys nib iyi 


He 


Leu 


Glv 
oiy 












T \re PV^ c> 

uys trie 


si a T 1 o Gl \/ 
nla lie uiy 


Ala 


Ala 


Phe 
t ne 




f403 








Ala pi t« 


bcu iyr xrxo 


Thr 


He 


Val 
vox 




fi / U 








v^iy oer 


Met vai uys 


Arg 


Leu 


Al a 












Val ton 
Vol HSp 


T i O £± y C a y 

Lieu oer oer 


He 


Trp 


Tie 
lie 




500 








Gl v Thr 

«X Jf A 1 IX 


Mot - Al a T.eil 
lie u mo ueu 


Leu 


Ser 


Glv 

vjxy 




51 5 
jIj 








Glu Thr 


Leu Gly Lys 


Arg 


Leu 


Ala 




530 








Lys Leu 


Glu Ser Glu 


Asn 


Glu 


Ser 




545 








Thr Thr 


Asn Asn Ser 


Gly Leu 


Glu 




560 








Arg Asp 


Ser Gly Leu 


Gly Glu 






575 









<210> 6 
<211> 462 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7483598CD1 

<400> 6 



ASD 


Tvr 


Tyr Thr 


Phe Met Ala 




205 




210 


Ala 


Ser 


Gly Tyr 


Leu Val Val 




220 




225 


He 


Gly 


Met Lys 


Ser Arg Thr 




235 




240 


Phe 


Phe 


Ala Val 


Gly Thr Leu 




250 




255 


Val 


Arg 


Thr Trp 


Trp Leu Tyr 




265 




270 


Val 


Pro 


Phe He 


Leu Cys Cys 




280 




285 


Tro 


Leu 


Leu Ser 


Glu Gly Arg 




295 




300 


Asp 


He 


Met Ala 


Lvs Trr> Asn 




310 

J 1 u 




315 


Glu 


Leu 


Leu Ser 


Leu Asd Leu 




395 

,? x, J 




330 


Thr 
i iix 


Gl ii 
ui u 


Va 1 Gin 
veil uiu 


T.vfl Hi q Ann 
uyo nx 9 noil 




340 




345 
■j *± j 


Ser 


Tl e 
lie 


T h T* T .\f Q 

1 1 ix uy e» 


A T"rr Th r T .ei i 
niy i in ueu 




355 
J 3 3 




360 


vj i _y 


Ser 

Ocl 


T,pi i Gl v 
ueu v?xy 


Phe Tvr Ser 




370 




375 


Glv 
oxy 


Gly 


Acn Gl 11 
noi i oi u 


Tvr Leu Asn 




385 




390 


He 


Pro 


Ala Tyr 


Thr Phe Val 




400 




405 


Aro 
niy 


Arg 


Thr Val 


Leu Ala Tyr 




415 




420 


Gvq 
v^y o 


Glv 


Val Val 

VOX V ctx 


Met Val He 

l ie W» v a X X X 




430 




435 


Val 
vai 


Val 
v ctx 


Thr Al a 

11 IX nlQ 


Met Val Glv 
iu e l. vox vjxy 




445 




450 


Glv 
uiy 


Leu 


x x e i y x 


T,pn Tvr Thr 
ueu iyi Aiii- 




4fin 




465 


Arg 


Ser 
out 


T .on Ala 
ueu nla 


Val Glv Ser 
vox vsxy oex 




475 




480 


Ser 


lie 


Leu Ala 


Pro Phe Ser 




4Q0 




495 


Phe 


Tie 
lie 


Pyn Gl n 


Leu Phe Val 
ueu it lie vox 




505 




510 


Val 


Leu 


Thr Leu 


Lys Leu Pro 




520 




525 


Thr 


Thr 


Trp Glu 


Glu Ala Ala 




535 




540 


Lys 


Ser 


Ser Lys 


Leu Leu Leu 




550 




555 


Lys 


Thr 


Glu Ala 


He Thr Pro 




565 




570 
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Met Gly Tyr Gin Arg Gin Glu Pro Val He Pro Pro Gin Arg Asp 
15 10 15 

Leu Asp Asp Arg Glu Thr Leu Val Ser Glu His Glu Tyr Lys Glu 
20 25 30 

Lys Thr Cys Gin Ser Ala Ala Leu Phe Asn Val Val Asn Ser He 
35 40 45 

He Gly Ser Gly He He Gly Leu Pro Tyr Ser Met Lys Gin Ala 
50 55 60 

Gly Phe Pro Leu Gly He Leu Leu Leu Phe Trp Val Ser Tyr Val 
65 70 75 

Thr Asp Phe Ser Leu Val Leu Leu He Lys Gly Gly Ala Leu Ser 
80 85 90 

Gly Thr Asp Thr Tyr Gin Ser Leu Val Asn Lys Thr Phe Gly Phe 
95 100 105 

Pro Gly Tyr Leu Leu Leu Ser Val Leu Gin Phe Leu Tyr Pro Phe 

110 115 120 

He Ala Met He Ser Tyr Asn He He Ala Gly Asp Thr Leu Ser 

125 130 135 

Lys Val Phe Gin Arg He Pro Gly Val Asp Pro Glu Asn Val Phe 

140 145 150 

He Gly Arg His Phe He He Gly Leu Ser Thr Val Thr Phe Thr 

155 160 165 

Leu Pro Leu Ser Leu Tyr Arg Asn He Ala Lys Leu Gly Lys Val 

170 175 180 

Ser Leu He Ser Thr Gly Leu Thr Thr Leu He Leu Gly He Val 

185 190 195 

Met Ala Arg Ala He Ser Leu Gly Pro His He Pro Lys Thr Glu 

200 205 210 

Asp Ala Trp Val Phe Ala Lys Pro Asn Ala He Gin Ala Val Gly 

215 220 225 

Val Met Ser Phe Ala Phe He Cys His His Asn Ser Phe Leu Val 

230 235 240 

Tyr Ser Ser Leu Glu Glu Pro Thr Val Ala Lys Trp Ser Arg Leu 

245 250 255 

He His Met Ser He Val lie Ser Val Phe He Cys lie Phe Phe 

260 265 270 

Ala Thr Cys Gly Tyr Leu Thr Phe Thr Gly Phe Thr Gin Gly Asp 

275 280 285 

Leu Phe Glu Asn Tyr Cys Arg Asn Asp Asp Leu Val Thr Phe Gly 

290 295 300 

Arg Phe Cys Tyr Gly Val Thr Val lie Leu Thr Tyr Pro Met Glu 

305 310 315 

Cys Phe Val Thr Arg Glu Val lie Ala Asn Val Phe Phe Gly Gly 

320 325 330 

Asn Leu Ser Ser Val Phe His He Val Val Thr Val Met Val He 

335 340 345 

Thr Val Ala Thr Leu Val Ser Leu Leu He Asp Cys Leu Gly lie 

350 355 360 

Val Leu Glu Leu Asn Gly Val Leu Cys Ala Thr Pro Leu He Phe 

365 370 375 

lie lie Pro Ser Ala Cys Tyr Leu Lys Leu Ser Glu Glu Pro Arg 

380 385 390 

Thr His Ser Asp Lys lie Met Ser Cys Val Met Leu Pro lie Gly 

395 400 405 

Ala Val Val Met Val Phe Gly Phe Val Met Ala lie Thr Asn Thr 

410 415 420 

Gin Asp Cys Thr His Gly Gin Glu Met Phe Tyr Cys Phe Pro Asp 

425 430 435 

Asn Phe Ser Leu Thr Asn Thr Ser Glu Ser His Val Gin Gin Thr 

440 445 450 

Thr Gin Leu Ser Thr Leu Asn lie Ser lie Phe Gin 

455 460 
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<210> 7 
<211> 1018 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7484823CD1 

<400> 7 



Met Gly 


Glu 


Gly 


Asp Ser Gly 


Leu 


Arg 


Ser 


Pro 


Arg 


Gly 


Pro 


Gin 


1 






5 






10 










15 


Glu Gly 


Gly 


Lys 


Cys Ser Arg 


Glu 


Lys 


Gin 


Lys 


Arg 


Asn 


Met 


Glu 








20 






25 










30 


Glu Leu 


Lys 


Lys 


Glu Val Val 


Met 


Asp 


Asp 


His 


Lys 


Leu 


Thr 


Leu 








35 






40 










45 


Glu Glu 


Leu 


Ser 


Thr Lys Tyr 


Ser 


Val 


Asp 


Leu 


Thr 


Lys 


Gly His 








50 






55 










60 


Ser His 


Gin 


Arg 


Ala Lys Glu 


He 


Leu 


Thr 


Arg 


Gly 


Gly 


Pro 


Asn 






65 






70 










75 


Thr Val 


Thr 


Pro 


Pro Pro Thr 


Thr 


Pro 


Glu 


Trp 


Val 


Lys 


Phe 


Cys 








80 






85 










90 


Lys Gin 


Leu 


Phe 


Gly Gly Phe 


Ser 


Leu 


Leu 


Leu 


Trp 


Thr 


Gly Ala 








95 






100 










105 


lie Leu 


Cys 


Phe 


Val Ala Tyr 


Ser 


He 


Gin 


He 


Tyr 


Phe 


Asn 


Glu 






110 






115 










120 


Glu Pro 


Thr 


Lys 


Asp Asn Leu 


Tyr 


Leu 


Ser 


He 


Val 


Leu 


Ser 


Val 






125 






130 










135 


Val Val 


He 


Val 


Thr Gly Cys 


Phe 


Ser 


Tyr 


Tyr 


Gin 


Glu 


Ala 


Lys 








140 






145 










150 


Ser Ser 


Lys 


He 


Met Glu Ser 


Phe 


Lys 


Asn 


Met 


Val 


Pro 


Gin 


Gin 






155 






160 










165 


Ala Leu 


Val 


He 


Arg Gly Gly 


Glu 


Lys 


Met 


Gin 


He 


Asn 


Val 


Gin 








170 






175 










180 


Glu Val 


Val 


Leu 


Gly Asp Leu 


Val 


Glu 


He 


Lys 


Gly 


Gly 


Asp Arg 








185 






190 










195 


Val Pro 


Ala 


Asp 


Leu Arg Leu 


He 


Ser 


Ala 


Gin 


Gly 


Cys 


Lys 


Val 






200 






205 










210 


Asp Asn 


Ser 


Ser 


Leu Thr Gly 


Glu 


Ser 


Glu 


Pro 


Gin 


Ser 


Arg 


Ser 






215 






220 










225 


Pro Asp 


Phe 


Thr 


His Glu Asn 


Pro 


Leu 


Glu 


Thr 


Arg 


Asn 


He 


Cys 






230 






235 










240 


Phe Phe 


Ser 


Thr 


Asn Cys Val 


Glu 


Gly 


Thr 


Ala 


Arg 


Gly 


He 


Val 








245 






250 










255 


He Ala 


Thr 


Gly 


Asp Ser Thr 


Val 


Met 


Gly 


Arg 


He 


Ala 


Ser 


Leu 








260 






265 










270 


Thr Ser 


Gly 


Leu 


Ala Val Gly 


Gin 


Thr 


Pro 


He 


Ala 


Ala 


Glu 


lie 






275 






280 










285 


Glu His 


Phe 


He 


His Leu He 


Thr 


Val 


Val 


Ala 


Val 


Phe 


Leu Gly 








290 






295 










300 


Val Thr 


Phe 


Phe 


Ala Leu Ser 


Leu 


Leu 


Leu 


Gly 


Tyr 


Gly 


Trp 


Leu 








305 






310 










315 


Glu Ala 


He 


He 


Phe Leu He 


Gly 


He 


He 


Val 


Ala 


Asn 


Val 


Pro 








320 






325 










330 


Glu Gly 


Leu 


Leu 


Ala Thr Val 


Thr 


Val 


Cys 


Leu 


Thr 


Leu 


Thr 


Ala 






335 






340 










345 


Lys Arg 


Met 


Ala 


Arg Lys Asn 


Cys 


Leu 


Val 


Lys 


Asn 


Leu 


Glu 


Ala 








350 






355 










360 


Val Glu 


Thr 


Leu 


Gly Ser Thr 


Ser 


Thr 


He 


Cys 


Ser 


Asp 


Lys 


Thr 








365 






370 










375 


Gly Thr 


Leu 


Thr 


Gin Asn Arg 


Met 


Thr 


Val 


Ala 


His 


Met 


Trp 


Phe 






380 






385 










390 


Asp Met 


Thr 


Val 


Tyr Glu Ala 


Asp 


Thr 


Thr 


Glu 


Glu 


Gin 


Thr Gly 
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395 

Lys Thr Phe Thr Lys Ser Ser Asp 
410 

He Ala Gly Leu Cys Asn Arg Ala 
425 

He Leu Pro He Ala Lys Arg Ala 
440 

Ser Ala Leu Leu Lys Phe He Glu 
455 

Glu Met Arg Glu Lys Asn Pro Lys 
470 

Ser Thr Asn Lys Tyr Gin Met Ser 
485 

Ser Gin Thr His Val Leu Met Met 
500 

Leu Glu Phe Cys Ser Thr Phe Leu 
515 

Met Asn Asp Glu Met Lys Glu Ala 
530 

Leu Gly Gly Leu Gly Glu Arg Val 
545 

Leu Pro Ser Ser Phe Ser Lys Gly 
560 

He Asn Phe Pro Met Asp Asn Leu 
575 

Met He Asp Pro Pro Arg Ala Ala 
590 

Cys Arg Ser Ala Gly He Lys Val 
605 

Pro He Thr Ala Lys Ala He Ala 
620 

Glu Gly Thr Glu Thr Ala Glu Glu 
635 

Pro He Ser Lys Val Asp Ala Ser 
650 

His Gly Ala Glu Leu Lys Asp He 
665 

He Leu Gin Asn His Pro Glu He 
680 

Gin Gin Lys Leu He He Val Glu 
695 

Val Val Ala Val Thr Gly Asp Gly 
710 

Lys Lys Ala Asp He Gly lie Ala 
725 

Val Ser Lys Gin Ala Ala Asp Met 
740 

Ala Ser He Val Thr Gly Val Glu 
755 

Asn Leu Lys Lys Ser He Met Tyr 
770 

Glu He Thr Pro Phe Leu Met Phe 
785 

Pro Leu Gly Thr He Thr He Leu 
800 

Met Val Pro Ala He Ser Leu Ala 
815 

He Met Lys Arg Leu Pro Arg Asn 
830 

Asn His Arg Leu He Gly Met Ala 
845 

Gin Ala Leu Ala Gly Phe Phe Thr 
860 



400 405 
Thr Trp Phe Met Leu Ala Arg 

415 420 
Asp Phe Lys Ala Asn Gin Glu 

430 435 
Thr Thr Gly Asp Ala Ser Glu 

445 450 
Gin Ser Tyr Ser Ser Val Ala 

460 465 
Val Ala Glu Val Pro Phe Asn 

475 480 
lie His Leu Arg Glu Asp Ser 

490 495 
Lys Gly Ala Pro Glu Arg lie 

505 510 
Leu Asn Gly Gin Glu Tyr Ser 

520 525 
Phe Gin Asn Ala Tyr Leu Glu 

535 540 
Leu Gly Phe Cys Phe Leu Asn 

550 555 
Phe Pro Phe Asn Thr Asp Glu 

565 570 
Cys Phe Val Gly Leu lie Ser 

580 585 
Val Pro Asp Ala Val Ser Lys 

595 600 
He Met Val Thr Gly Asp His 

.610 615 
Lys Gly Val Gly lie lie Ser 

625 630 
Val Ala Ala Arg Leu Lys He 

640 645 
Ala Ala Lys Ala He Val Val 

655 660 
Gin Ser Lys Gin Leu Asp Gin 

670 675 
Val Phe Ala Arg Thr Ser Pro 

685 690 
Gly Cys Gin Arg Leu Gly Ala 

700 705 
Val Asn Asp Ser Pro Ala Leu 

715 720 
Met Gly lie Ser Gly Ser Asp 

730 735 
lie Leu Leu Asp Asp Asn Phe 

745 750 
Glu Gly Arg Leu lie Phe Asp 

760 765 
Thr Leu Thr Ser Asn lie Pro 

775 780 
He lie Leu Gly lie Pro Leu 

790 795 
Cys lie Asp Leu Gly Thr Asp 

805 810 
Tyr Glu Ser Ala Glu Ser Asp 

820 825 
Pro Lys Thr Asp Asn Leu Val 

835 840 
Tyr Gly Gin lie Gly Met He 

850 855 
Tyr Phe Val lie Leu Ala Glu 

865 870 
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Asn 




Phe 


Arg 


pro vai Asp 


Leu 


Leu 


oiy Ale niy JjcU MIS 


Trp 










P*7 ^ 
O / D 






GOV 






Asp 


Lys 


Tyr 


Leu Asn Asp 


Leu 


VjlU 


Ren Cor" TH/v f*! \/ f 1 t-i 

nop oeir lyZ: v?iy lain 


bin 


















on n 


Trp 


inr 


Tyr 




o-Ln Hry L»ys 


val 


Val 




1 nr 
















01 ft 

3 J.U 


"1 j 


Ala 


nun 

pne 


pne 


vai 


mr lie vai 


Val 


Val 


uin irp Ala /\sp uGU. 


lie 
















go c: 


y j u 


lie 


ser 




Thr 


Arg Arg Asn 


Ser 


Leu 


pne uin uin L»iy Met 


Arg 








935 






940 




Asn 


Lys 


Val 


Leu 


lie Pne Giy 


He 


Leu 


Glu Glu Thr Leu Leu 


Aia 








950 






955 


y ou 


Ala 


Phe 


Leu 


Ser 


Tyr Thr Pro 


Gly Met 


Asp Val Ala Leu Arg 


Met 










965 






970 


975 


Tyr 


Pro 


Leu 


Lys 


He Thr Trp 


Trp 


Leu 


Cys Ala He Pro Tyr 


Ser 










980 






985 


990 


He 


Leu 


He 


Phe 


Val Tyr Asp 


Glu 


He 


Arg Lys Leu Leu He 


Arg 










995 




1000 


1005 


Gin 


His 


Pro 


Asp 


Gly Trp Val 


Glu Arg Glu Thr Tyr Tyr 










1010 




1015 





<210> 8 
<211> 313 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 143935CD1 



<400> 8 


















Met 


Glu 


Glu 


Met 


Ser Gly 


Glu 


Ser 


Val 


Val Ser Ser Ala Val Pro 


1 








5 










10 15 


Ala 


Ala 


Ala 


Thr 


Arg Thr 


Thr 


Ser 


Phe 


Lys Gly Thr Ser Pro Ser 










20 










25 30 


Ser 


Lys 


Tyr 


Val 


Lys 
35 


Leu 


Asn 


Val 


Gly 


Gly Ala Leu Tyr Tyr Thr 
40 45 


Thr 


Met 


Gin 


Thr 


Leu 
50 


Thr 


Lys 


Gin 


Asp 


Thr Met Leu Lys Ala Met 
55 60 


Phe 


Ser 


Gly 


Arg 


Met 


Glu 


Val 


Leu 


Thr 


Asp Ser Glu Gly Trp He 








65 










70 75 


Leu 


He 


Asp 


Arg 


Cys Gly 


Lys 


His 


Phe 


Gly Thr He Leu Asn Tyr 








80 










85 90 


Leu 


Arg 


Asp 


Gly 


Ala Val 


Pro 


Leu 


Pro 


Glu Ser Arg Arg Glu He 






95 










100 105 


Glu 


Glu 


Leu 


Leu 


Ala 
110 


Glu 


Ala 


Lys 


Tyr 


Tyr Leu Val Gin Gly Leu 
115 120 


Val 


Glu 


Glu 


Cys 


Gin 


Ala 


Ala 


Leu 


Gin 


Asn Lys Asp Thr Tyr Glu 








125 










130 135 


Pro 


Phe 


Cys 


Lys 


Val 


Pro 


Val 


He 


Thr 


Ser Ser Lys Glu Glu Gin 








140 










145 150 


Lys 


Leu 


He 


Ala 


Thr 
155 


Ser 


Asn 


Lys 


Pro 


Ala Val Lys Leu Leu Tyr 
160 165 


Asn 


Arg 


Ser 


Asn 


Asn 
170 


Lys 


Tyr 


Ser 


Tyr 


Thr Ser Asn Ser Asp Asp 
175 180 


Asn 


Met 


Leu 


Lys 


Asn 


He 


Glu 


Leu 


Phe 


Asp Lys Leu Ser Leu Arg 








185 










190 195 


Phe 


Asn 


Gly 


Arg 


Val 


Leu 


Phe 


He 


Lys 


Asp Val He Gly Asp Glu 








200 










205 210 


He 


Cys 


Cys 


Trp 


Ser 


Phe 


Tyr 


Gly Gin 


Gly Arg Lys He Ala Glu 










215 










220 225 


Val 


Cys 


Cys 


Thr 


Ser 


He 


Val 


Tyr Ala 


Thr Glu Lys Lys Gin Thr 








230 










235 240 
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Lys Val Glu Phe Pro Glu Ala Arg lie Tyr Glu Glu Thr Leu Asn 

245 250 255 

He Leu Leu Tyr Glu Ala Gin Asp Gly Arg Gly Pro Asp Asn Ala 

260 265 270 

Leu Leu Glu Ala Thr Gly Gly Ala Ala Gly Arg Ser His His Leu 

275 280 285 

Asp Glu Asp Glu Glu Arg Glu Arg He Glu Arg Val Arg Arg He 

290 295 300 

His He Lys Arg Pro Asp Asp Arg Ala His Leu His Gin 

305 310 

<210> 9 
<211> 921 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5923789CD1 

<400> 9 

Met Ala Trp Leu Arg Leu Gin Pro Leu Thr Ser Ala Phe Leu His 
15 10 15 

Phe Gly Leu Val Thr Phe Val Leu Phe Leu Asn Gly Leu Arg Ala 
20 25 30 

Glu Ala Gly Gly Ser Gly Asp Val Pro Ser Thr Gly Gin Asn Asn 
35 40 45 

Glu Ser Cys Ser Gly Ser Ser Asp Cys Lys Glu Gly Val He Leu 
50 55 60 

Pro He Trp Tyr Pro Glu Asn Pro Ser Leu Gly Asp Lys He Ala 
65 70 75 

Arg Val He Val Tyr Phe Val Ala Leu He Tyr Met Phe Leu Gly 
80 85 90 

Val Ser He He Ala Asp Arg Phe Met Ala Ser He Glu Val He 
95 100 105 

Thr Ser Gin Glu Arg Glu Val Thr He Lys Lys Pro Asn Gly Glu 

110 115 120 

Thr Ser Thr Thr Thr He Arg Val Trp Asn Glu Thr Val Ser Asn 

125 130 135 

Leu Thr Leu Met Ala Leu Gly Ser Ser Ala Pro Glu He Leu Leu 

140 145 150 

Ser Leu He Glu Val Cys Gly His Gly Phe He Ala Gly Asp Leu 

155 160 165 

Gly Pro Ser Thr He Val Gly Ser Ala Ala Phe Asn Met Phe He 

170 175 180 

He He Gly He Cys Val Tyr Val He Pro Asp Gly Glu Thr Arg 

185 190 195 

Lys He Lys His Leu Arg Val Phe Phe He Thr Ala Ala Trp Ser 

200 205 210 

He Phe Ala Tyr He Trp Leu Tyr Met He Leu Ala Val Phe Ser 

215 220 225 

Pro Gly Val Val Gin Val Trp Glu Gly Leu Leu Thr Leu Phe Phe 

230 235 240 

Phe Pro Val Cys Val Leu Leu Ala Trp Val Ala Asp Lys Arg Leu 

245 250 255 

Leu Phe Tyr Lys Tyr Met His Lys Lys Tyr Arg Thr Asp Lys His 

260 265 270 

Arg Gly He He He Glu Thr Glu Gly Asp His Pro Lys Gly He 

275 280 285 

Glu Met Asp Gly Lys Met Met Asn Ser His Phe Leu Asp Gly Asn 

290 295 300 

Leu Val Pro Leu Glu Gly Lys Glu Val Asp Glu Ser Arg Arg Glu 

305 310 315 
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Met 


He 


Arg 


Asp 


Leu 


Asp 


His 


Gin 


Gin 


Met 


Met 


Thr 


Gin 


Ala 


Lys 


Pro 


Glu 


Asp 


Gin 


Cys 


Leu 


Lys 


Gly 


Gly 


Glu 


Asp 


Gly 


Gly 


Thr 


Val 


Val 


Gly 


He 


Phe 


Val 


Arg 


Glu 


Gly 


Met 


Ala 


Val 


Leu 


Asp 


Asp 


His 


Val 


Ser 


Glu 


Ser 


Gly 


Ala 


Gly 


Thr 


Ala 


Glu 


Leu 


Glu 


Lys 


He 


Val 


He 


Ala 


Leu 


Val 


Thr 


Asp 


He 


Ala 


Glu 


Glu 


Val 


He 


Lys 


Leu 


He 


Ser 


Trp 


Arg 


Gly 


Asp 


Glu 


Cys 


Phe 


Asp 


Leu 


Phe 


Ala 


Cys 


Phe 


Ala 


He 


Gly 


Asp 


Asp 


Ser 


Val 



He 


Leu 


Lys 




320 




Gin 


Leu 


Val 




1 o c 

3 Jb 




Lys 


Ser 


Arg 




350 




Gly 


Ala 


Gly 




365 




Lys 


Ala 


Ser 




380 




Phe 


He 


Ser 




395 




Glu 


Asn 


Cys 




410 




Asp 


Met 


Ser 




425 




Ser 


Ala 


Asn 




440 




Val 


Leu 


Lys 




455 




He 


Asp 


Asp 




470 




Leu 


Ser 


Asn 




485 




Pro 


Pro 


Ala 




500 




Ala 


Ser 


Pro 




515 




Ala 


Gly 


He 




530 




Ser 


He 


Gly 




545 




Arg 


Gly 


Thr 




560 




Lys 


Gly 


Gly 




575 




Phe 


Lys 


Asn 




590 




Asp 


Glu 


Glu 




605 




Gly 


Glu 


Pro 




620 




Arg 


Lys 


Leu 




635 




Met 


Gly 


Lys 




650 




He 


Glu 


Glu 




665 




Lys 


Lys 


Thr 




680 




Asp 


Gin 


Phe 




695 




Asp 


Glu 


Asp 




710 




Tyr 


Val 


Met 




725 




Cys 


Val 


Pro 




740 




Val 


Ser 


He 




755 




Leu 


Ala 


Ser 




770 




Thr 


Ala 


Val 



Asp 


Leu 


Lys 


Glu 


Met 


Ala 


Ala 


Phe 


Tyr 


Asn 


lie 


Leu 


Ser 


Met 


Ser 


Lys 


Val 


Phe 


Gly 


Ala 


Val 


Lys 


Thr 


Met 


Ala 


Gly 


Ala 


Pro 


Gly 


Glu 


Asp 


He 


Phe 


Val 


Arg 


He 


He 


Phe 


Asn 


Cys 


Val 


Ala 


Phe 


Thr 


Phe 


Val 


Met 


Glu 


Val 


He 


Val 


Gly 


Glu 


Asp 


Asp 


Glu 


Thr 


Glu 


Tyr 


Glu 


LVS 


Tro 


Met 


Thr 


Met 


Glu 


Pro 


Val 


Leu 


Ser 


Tyr 


Glu 


Asn 


Leu 


Ala 


Met 


Glu 


Ala 


Glu 


Ser 


Gly 


His 


Phe 


Leu 


Pro 


Thr 


Glu 


Leu 


lie 


lie 


His 


Phe 


Gly 


Val 


Phe 


Val 



Gin 


Lys 


His 


"J 1 c 

325 






Asn 


Tyr 


Tyr 


*a a n 
340 






Arg 


lie 


Gin 


355 






Lys 


Lys 


His 


370 






Glu 


vai 


His 


385 






Phe 


Asp 


Pro 


400 






Leu 


Leu 


Thr 


415 






Tyr 


Val 


Asp 


430 






Asp 


Tyr 


Glu 


445 






Thr 


Gin 


Lys 


460 






Glu 


Glu 


Asp 


475 






Glu 


Glu 


Glu 


490 






Ser 


Leu 


Pro 


505 






Thr 


Val 


Thr 


520 






Glu 


Cys 


Asp 


535 






Val 


Lys 


Val 


550 






Pro 


Phe 


Arg 


565 






Phe 


Glu 


Asp 


580 






Val 


Lys 


Thr 


595 






Arg 


Gin 


Glu 


610 






Glu 


Arg 


Gly 


625 






Glu 


Glu 


Glu 


640 






Gly 


Glu 


His 


655 






Phe 


Lys 


Thr 


670 






Leu 


Val 


Val 


685 






lie 


Thr 


Val 


700 






Glu 


Glu 


Arg 


715 






Thr 


val 


pne 


730 






Tyr 


Cys 


His 


745 






Gly 


Met 


Leu 


760 






Cys 


Thr 


He 


775 






Ala 


Phe 


Gly 



17/83 



Pro 


Glu 


Lys 






33U 


Aia 


Leu 


Ser 






T A C 

345 


7V "I _ 

Ala 


Thr 


Arg 








TV T a 

Ala 


Ala 


Glu 






375 


Thr 


Asp 


Glu 






390 


Cys 


Ser 


Tyr 






405 


Val 


Val 


Arg 






420 


Tyr 


Lys 


Thr 






435 


Phe 


Thr 


Glu 






450 


Glu 


Phe 


Ser 






465 


Glu 


His 


Phe 






480 


Gin 


Pro 


Glu 






495 


Leu 


Pro 


Arg 






510 


lie 


Leu 


Asp 






525 


Thr 


lie 


His 






540 


Leu 


Arg 


Thr 






555 


Thr 


Val 


Glu 






570 


Thr 


Tyr 


Gly 






585 


lie 


Arg 


Val 






600 


Asn 


Phe 


Phe 






615 


lie 


Ser 


Asp 






630 


Ala 


Lys 


Arg 






645 


Pro 


Lys 


Leu 






660 


Thr 


Val 


Asp 






675 


Gly 


Thr 


His 






690 


Ser 


Ala 


Ala 






705 


Leu 


Pro 


Ser 






720 


Trp 


Lys 


Val 






735 


Gly 


Trp 


Ala 






750 


Thr 


Ala 


lie 






765 


Gly 


Leu 


Lys 






780 


Thr 


Ser 


Val 



785 790 795 

Pro Asp Thr Phe Ala Ser Lys Ala Ala Ala Leu Gin Asp Val Tyr 

800 805 810 

Ala Asp Ala Ser He Gly Asn Val Thr Gly Ser Asn Ala Val Asn 

815 820 825 

Val Phe Leu Gly He Gly Leu Ala Trp Ser Val Ala Ala He Tyr 

830 835 840 

Trp Ala Leu Gin Gly Gin Glu Phe His Val Ser Ala Gly Thr Leu 

845 850 855 

Ala Phe Ser Val Thr Leu Phe Thr He Phe Ala Phe Val Cys He 

860 865 870 

Ser Val Leu Leu Tyr Arg Arg Arg Pro His Leu Gly Gly Glu Leu 

875 880 885 

Gly Gly Pro Arg Gly Cys Lys Leu Ala Thr Thr Trp Leu Phe Val 

890 895 900 

Ser Leu Trp Leu Leu Tyr He Leu Phe Ala Thr Leu Glu Ala Tyr 

905 910 915 

Cys Tyr He Lys Gly Phe 

920 

<210> 10 
<211> 1466 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 6046484CD1 

<400> 10 

Met Tyr He Arg Val Ser Tyr Asp Thr Lys Pro Asp Ser Leu Leu 
15 10 15 

His Leu Met Val Lys Asp Trp Gin Leu Glu Leu Pro Lys Leu Leu 
20 25 30 

lie Ser Val His Gly Gly Leu Gin Asn Phe Glu Met Gin Pro Lys 
35 40 45 

Leu Lys Gin Val Phe Gly Lys Gly Leu He Lys Ala Ala Met Thr 
50 55 60 

Thr Gly Ala Trp lie Phe Thr Gly Gly Val Ser Thr Gly Val He 
65 70 75 

Ser His Val Gly Asp Ala Leu Lys Asp His Ser Ser Lys Ser Arg 
80 85 90 

Gly Arg Val Cys Ala lie Gly He Ala Pro Trp Gly He Val Glu 
95 100 105 

Asn Lys Glu Asp Leu Val Gly Lys Asp Val Thr Arg Val Tyr Gin 
110 115 120 

Thr Met Ser Asn Pro Leu Ser Lys Leu Ser Val Leu Asn Asn Ser 
125 130 135 

His Thr His Phe lie Leu Ala Asp Asn Gly Thr Leu Gly Lys Tyr 
140 145 150 

Gly Ala Glu Val Lys Leu Arg Arg Gin Leu Glu Lys His lie Ser 
155 160 165 

Leu Gin Lys lie Asn Thr Arg lie Gly Gin Gly Val Pro Val Val 
170 175 180 

Ala Leu lie Val Glu Gly Gly Pro Asn Val lie Ser He Val Leu 
185 190 195 

Glu Tyr Leu Arg Asp Thr Pro Pro Val Pro Val Val Val Cys Asp 
200 205 210 

Gly Ser Gly Arg Ala Ser Asp lie Leu Ala Phe Gly His Lys Tyr 
215 220 225 

Ser Glu Glu Gly Gly Leu lie Asn Glu Ser Leu Arg Asp Gin Leu 
230 235 240 

Leu Val Thr lie Gin Lys Thr Phe Thr Tyr Thr Arg Thr Gin Ala 
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Gin His Leu Phe 
Leu lie Thr Val 
Asp Leu Ala lie 
Ala Pro Asp Gin 
He Ala Arg Ser 
Gly Ser Leu Glu 
Val Asp Phe Val 
Arg Phe Leu Thr 
His Gly Pro Ser 
Lys Gly Asn Leu 
Gly Leu Val He 
Tyr Thr Arg Lys 
Pro Lys Arg Asp 
Lys Lys Arg Glu 
He Asn His Phe 
Val Leu Met Lys 
Gly Glu Glu Ala 
Lys Ala Met Ala 
He Ser Gin Glu 
Ala Val Glu Leu 
Ala Met Lys Leu 
Thr Cys Leu Gin 
Ala His Thr Cys 
Arg Leu Arg Met 
He Leu Leu Pro 
Asp Asp Phe Ser 
Glu Lys Glu Glu 
Arg Lys Gly Asp 
Pro He Gly Thr 
Lys Phe Trp Phe 
Phe Asn Tyr Val 



245 250 255 

He He Leu Met Glu Cys Met Lys Lys Lys Glu 
260 265 270 

Phe Arg Met Gly Ser Glu Gly His Gin Asp He 
275 280 285 

Leu Thr Ala Leu Leu Lys Gly Ala Asn Ala Ser 
290 295 300 

Leu Ser Leu Ala Leu Ala Trp Asn Arg Val Asp 
305 310 315 

Gin He Phe He Tyr Gly Gin Gin Trp Pro Val 
320 325 330 

Gin Ala Met Leu Asp Ala Leu Val Leu Asp Arg 
335 340 345 

Lys Leu Leu He Glu Asn Gly Val Ser Met His 
350 355 360 

He Ser Arg Leu Glu Glu Leu Tyr Asn Thr Arg 
365 370 375 

Asn Thr Leu Tyr His Leu Val Arg Asp Val Lys 
380 385 390 

Pro Pro Asp Tyr Arg He Ser Leu He Asp He 
395 400 405 

Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn 
410 415 420 

Arg Phe Arg Thr Leu Tyr His Asn Leu Phe Gly 
425 430 435 

Asp He Pro Leu Arg Arg Gly Arg Lys Thr Thr 
440 445 450 

Glu Glu Val Asp He Asp Leu Asp Asp Pro Glu 
455 460 465 

Pro Phe Pro Phe His Glu Leu Met Val Trp Ala 
470 475 480 

Arg Gin Lys Met Ala Leu Phe Phe Trp Gin His 
485 490 495 

Met Ala Lys Ala Leu Val Ala Cys Lys Leu Cys 
500 505 510 

His Glu Ala Ser Glu Asn Asp Met Val Asp Asp 
515 520 525 

Leu Asn His Asn Ser Arg Asp Phe Gly Gin Leu 
530 535 540 

Leu Asp Gin Ser Tyr Lys Gin Asp Glu Gin Leu 
545 550 555 

Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ala 
560 565 570 

Leu Ala Val Ala Ala Lys His Arg Asp Phe He 
575 580 585 

Ser Gin Met Leu Leu Thr Asp Met Trp Met Gly 
590 595 600 

Arg Lys Asn Ser Gly Leu Lys Val He Leu Gly 
605 610 615 

Pro Thr He Leu Phe Leu Glu Phe Arg Thr Tyr 
620 625 630 

Tyr Gin Thr Ser Lys Glu Asn Glu Asp Gly Lys 
635 640 645 

Glu Asn Thr Asp Ala Asn Ala Asp Ala Gly Ser 
650 655 660 

Glu Glu Asn Glu His Lys Lys Gin Arg Ser He 
665 670 675 

Lys He Cys Glu Phe Tyr Asn Ala Pro He Val 
680 685 690 

Phe Gin He Ser Tyr Leu Gly Tyr Leu Leu Leu 
695 700 705 

He Leu Val Arg Met Asp Gly Trp Pro Ser Leu 
710 715 720 
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T 1 o \7a 1 
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77fi 


Leu 


bin 


Asn 


Gin Pro 








/ O J 


va i 


ASp 


Tl rr» 
lie 


lie trl lc 








oUU 


\Ta 1 
val 


Asn 


Lys 


lyr jjeu 








Q1 C 


vaJi 


Ser 


Ser 


Gly lie 








OTA 

830 


Mat- 
Met 


oer 


pne 


Giy vai 








845 


Lys 


Pro 


Ser 


Trp Lys 








obO 


Trp 


Met 


lie 


Tyr Gly 








O *7 C 

875 


Ser 


Pne 


pne 


Leu Ser 








oyu 


GlU 


Giy 


Lys 


Arg Leu 








one 
y 05 


Fro 


Ala 

Aia 


Leu 


Met Aia 










Val 


Asn 


Leu 


Leu Xle 










Lys 


Ser 


lie 


Ser Asn 








OCA 

950 


lie 


Met 


Tnr 


Phe His 








965 


lie 


Leu 


Ser 


His lie 








980 


Cys 


Arg 


Lys 


Lys Arg 








995 


Leu 


Ser 


Met 


Phe Leu 








1010 


Pne 


Glu 


Glu 


Gin Cys 








lUZD 


GlU 


Gin 


Gin 


Ser Ser 








1040 


Arg 


val 


GlU 


Asn Met 








1 A C C 

lUbb 


Glu 


Thr 


pne 


Met Lys 








1 A"7 A 
1U / U 


Ala 


Gin 


Leu 


GlU GlU 








1 A D C 
11) OD 


Asn 


Leu 


Ala 


r*l %r Tin 

vjiy lie 








I 1 AA 

II uu 


Arg 


Aia 


ber 


Ser Glu 








i i i c 

111b 


Ser 


lie 


Asn 


ber Aia 








1130 


Asn 


Gly 


Glu 


Glu Leu 








1145 


Pro 


Gly 


Thr 


Gly Val 








1160 


Glu 


Glu 


Lys 


Asp Val 








1175 


Ser 


Leu 


His 


Leu Ser 



lie 


Ser 


r P\7T" 


lie v ax 








f 0 u 


Ala 


Thr 


ri vj 


lwyb nla 








74 c. 


Phe 


vjiy 


*r X lc 


v9in OlU 








7fin 
/ o u 


Cpr 


l 111 




Mot- Tlo 

nee lie 








77 C 


Tyr 


Mot- 

net. 


r*1 vr 

ijiy 


ryr Giy 








7on 
/ y u 


Trp 


Tyr 


Tin 

lie 


Arg vai 








O U5 


Giy 


pro 


Tyr 


vai Met 








QO A 


Leu 


Trp 


i» Q 1 

vai 


vai ne 








B3b 


Ala 


TV m 

Arg 


Gin 


Ala lie 








OCA 

850 


Leu 


Aia 


Arg 


Asn lie 








865 


Glu 


Val 


pne 


Ala Asp 








880 


Aia 


pro 


Cys 


p i . _ pi . , 
Giy GlU 








by b 


Pro 


Pro 


Cys 


lie Pro 








Q 1 A 

y 10 


Cys 


Tyr 


Leu 


Leu vai 








y 2b 


JVl 9 

Aia 


vai 


pne 


Ser Asn 








Q >1 A 

y4u 


r»l v\ 

Gin 


vai 


Trp 


Lys Phe 








n c c 

955 


Asp 


Arg 


Pro 


Val Leu 








AIA 

970 


Tyr 


Tl *■» 

lie 


Ti- 
ne 


lie Met 








985 


Glu 


Gly 


Asp 


Gin Glu 








1000 


Ser 


Asp 


Glu 


Glu Leu 








1 A*l C 

1015 


val 


Gin 


Glu 


His Phe 








1 A*J A 


Ser 


Asp 


p 1 11 

GlU 


TV Tl « 

Arg ne 








1 A » C 

1045 


Ser 


Met 


Arg 


Leu Glu 








i a c a 
1 UbU 


Thr 


Ser 


Leu 


bin inr 








1 A*"7 C 

1U / b 


Leu 


Ser 


Asn 


Arg Met 








1 AAA 

iuyu 


Asp 


Arg 


Ser 


7\ p» y*\ T At 1 

>\sp lieu 








11 AC 

llUb 


Cys 


Glu 


Ala 


Thr Tyr 








1 1 1 A 

1120 


Asp 


oiy 


Tyr 


Ser Leu 








1135 


Leu 


Phe 


Glu 


Asp Thr 








1150 


Arg 


Lys 


Lys 


Thr Cys 








1165 


Lys 


Thr 


His 


Leu Val 








1180 


Leu 


Gly 


Thr 


Ser Thr 



Car 
OCX 


Leu 


Ala 


Leu Glu 








-J-lC 

/ J J 


Cor 

OCl 




Car 
OCX. 


Pro Leu 








7 ^ n 
/ bu 


Tyr 


Trp 


Asn 


iie i nr 








/ob 


Giy 


Aia 


i ie 


Leu Arg 








7 Q A 

/ o U 


Arg 


vai 


Tl Q 

i ie 


Tyr Cys 








/ y b 


Leu 


Asp 


Tl Q 

lie 


riU A pi,. 

pne Giy 








Q 1 A 

olO 


Met 


Tl a 

ne 


pi,. 
Giy 


.Lys Met 








Q O C 

825 


Met 


Leu 


Ua 1 

vai 


Val Leu 








840 


Leu 


His 


Pro 


GlU GlU 








855 


Phe 


Tyr 


Met 


Pro Tyr 








870 


Gin 


lie 


Asp 


Arg Lys 








885 


Asn 


Leu 


Tyr 


Asp Glu 








Q A A 

900 


Gly 


Ala 


Trp 


Leu Thr 








QIC 

y 15 


7V 1 n 

Aia 


Asn 


Tl a 

lie 


Leu Leu 








A Tl A 

y 30 


Thr 


pne 


pne 


giu vai 








A VI C 


Gin 


Arg 


Tyr 


Gin Leu 








A C A 

960 


Pro 


Pro 


Pro 


W i_ Tin 

Met lie 








975 


Arg 


Leu 


Ser 


Gly Arg 








990 


Glu 


Arg 


Asp 


Arg Gly 








*1 AAC 

1005 


Lys 


Arg 


Leu 


His Glu 








1020 


Arg 


Glu 


Lys 


Glu Asp 








1 Ale 

10 35 


Arg 


Val 


Thr 


Ser Glu 








1 AC A 

lobo 


GlU 


lie 


Asn 


Glu Arg 








1 AC C 

lUob 


vai 


Asp 


Leu 


Arg Leu 








1 no a 


vai 


Asn 


7A 1 a 

Ala 


Leu Glu 








iuy b 


Tl o 

ne 


p i _ 
Gin 


Ala 

Aia 


Arg Ser 








*1 1 1 A 
1110 


Leu 


Leu 


Arg 


Gin faer 








1 1 1 c 




ai, y 




nib Irllc 








1140 


Ser 


Leu 


Ser 


Thr Ser 








1155 


Ser 


Phe 


Arg 


lie Lys 








1170 


Pro 


Glu 


Cys 


Gin Asn 








1185 


Ser 


Ala 


Thr 


Pro Asp 
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1 190 


1195 


1200 


Glv 
^1Y 


Ser 


His 


Leu Ala Val Asd 


Asp Leu Lys Asn Ala Glu 


Glu Ser 






1205 


1210 


1215 


Lys 


Leu 


Glv 


Pro Asp He Gly 


He Ser Lys Glu Asp Asp 


Glu Arg 








1220 


1225 


1230 


Gin 


Thr 


Asp 


Ser Lvs Lvs Glu 


Glu Thr He Ser Pro Ser 


Leu Asn 








1235 


1240 


1245 


Lys 


Thr 


Asp 


Val He His Gly 


Gin Asp Lys Ser Asp Val 


Gin Asn 








1250 


1255 


1260 


Thr 

X UX 


Gin 


JJCU 


Thr Val Glu Thr 

1111. VOX UXU 11 li 


Thr Asn He Glu Glv Thr 

l ill noli J. J± W \J ^ wt wAJ ^ 


He Ser 








1 ?fi5 


1270 


1275 


Tyr 




T .01 1 


<JlU blU 1111 Jjyo 


T1p Thr Am Tvr Phe Pro 

lie i Hi nl^ i jX flic 


Asn Glu 








1 9flft 
1ZOU 


1285 


1290 


i nr 


Tin 

lie 


Asn 


nia ^ys iiys 1 in 


Mot* T .vb Cor Irn Cor Pho 

l it; Lr Ujr D OCX m. y OCX fllC 


Val Tyr 










1 300 


1305 


Ser 


Arg 


oxy 


Arg iiys Lieu vai 


f*1 vr <"2l \r \7a 1 & can Gin AQn 
Vjiy x y vox riaii uiu risky 


Val Glu 








inn 

lJlU 


iji j 


1320 


Tyr 


oer 


Ser 


lie xnr asp uin 


Gin TiPii Thr Thr Glu Tro 

OXll UCU 1 111 1111 OIU iif 


Gin Cvs 








ij/j 




1335 


Ol n 


vol 


bin 


ijys lie iiix7 r\L y 


dor Hi r Cor Thr Asr> Tie 

OCX XllD OCX 1 11X AO^/ 11C 


Pro Tyr 








1 ^An 


1345 


1350 


lie 


Val 


Coy 


Pin Zila 211a \7a 1 
wlu nlo nia Val 


Gin Ala Glu His Lvs Glu 

V7X11 nld uiu nxo uy 0 uiu 


Gin Phe 








IjDj 




1365 


TV "t a 
rild 


Asp 


Met 


bill nop U1U nib 


His Val Ala Glu Ala He 


Pro Arg 






1 no 

1j / U 




1380 


lie 


Pro 


Arg 


Lieu oci Lieu i nr 


Tl a Thr A on Arrr Acn Glv 
lie llll nop ftiy noil vj.l y 


Met Glu 

liCl> VJ J- LA 






i *}ft5 


1390 


1395 


Asn 


L6U 


Leu 


Coy* \7a 1 T.vrn Prn 


Asn Gin Thr Leu Glv Phe 


Pro Ser 








1400 


1405 


1410 


Leu 


Arg 


Ser 


Lys Ser Leu His 


Gly His Pro Arg Asn Val 


Lys Ser 






1415 


1420 


1425 


He 


Gin 


Gly 


Lys Leu Asp Arg 


Ser Gly His Ala Ser Ser 


Val Ser 






1430 


1435 


1440 


Ser 


Leu 


Val 


He Val Ser Gly 


Met Thr Ala Glu Glu Lys 


Lys Val 








1445 


1450 


1455 


Lys 


Lys 


Glu 


Lys Ala Ser Thr 


Glu Thr Glu Cys 










1460 


1465 





<210> 11 
<211> 222 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7481427CD1 

<400> 11 



Met 


Thr 


Glu 


Gin 


Ala He 


Ser 


Phe 


Ala 


Lys Asp Phe Leu 


Ala 


Gly 


1 








5 








10 




15 


Gly 


He 


Thr 


Ala 


Ala He 


Ser 


Lys 


Thr 


Ala Val Ala Ser 


He 


Lys 








20 








25 




30 


Arg 


Val 


Gin 


Leu 


Leu Leu 


Gin 


Met 


Gin 


His Ala Ser Met 


Pro 


Met 








35 








40 




45 


Ala 


Ala 


Ala 


Lys 


Gin Cys 


Lys 


Gly 


He 


Val Asp Cys He 


Val 


Arg 








50 








55 




60 


He 


Pro 


Lys 


Asp 


Gin Gly 


Val 


Leu 


Ser 


Phe Trp Arg Gly Asn 


Leu 








65 








70 




75 


Ala 


Asn 


Val 


He 


Arg Tyr 


Ser 


Pro 


Thr 


Gin Ala Leu Asn 


Phe 


Ala 










80 








85 




90 


Phe 


Lys 


Asp 


Lys 


Tyr Lys 


Gin 


He 


Phe 


Leu Ala Gly Val 


Asp 


Lys 










95 








100 




105 


His 


Thr 


Gin 


Phe 


Cys Arg 


Tyr 


Phe 


Ala 


Gly Asn Leu Ala 


Ser 


Gly 
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1 JLU 




Ala \; a 1 
r\± d Veil 


vai iyr 






Jl ZD 


nXa Ala 


Asp va± 


Gly Lys 








Leu Gly 


Asp Cys 


Leu Val 






1 c c 


Gly Leu 


Tyr Gin 


Gly Phe 






170 


Tyr Gin 


Ala Ala 


Tyr Phe 






185 


Phe Pro 


Asp Pro 


Lys Asn 






200 


Ala Gin 


Thr Val 


Thr Ala 






215 



Pro Leu 
Ser Gly 
Lys lie 
Ser Val 
Arg Val 
Thr His 
Val Ala 



115 
Asp Phe 

130 
Thr Glu 

145 
Ser Lys 

160 
Ser Val 

175 
Tyr Asp 

190 
lie Leu 

205 
Gly Val 

220 



Thr Arg 
Arg Glu 
Ser Asp 
Gin Ala 
Thr Ala 
Val Ser 
Leu Ser 



120 

Thr Arg Leu 
135 

Phe Arg Gly 
150 

Gly lie Arg 
165 

lie He He 
180 

Asn Gly Met 
195 

Trp Met Thr 
210 



<210> 12 
<211> 461 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7483595CD1 



<400> 12 



Met 


Glu 


Thr 


Leu 


Leu 


Phe 


Leu 


Glu Ser Ala He Gly Ala He Ala 


1 








5 










10 15 


vjiy 


Leu 


Lys 


inr 


Phe 


Tyr 


Pro 


Thr 


Leu 


Val Phe Ser Ser Leu Gin 






20 










25 30 


Leu 


He 


Met 


Gly 


Val 


Leu 


Gly 


Leu Gly Phe He Ala Thr Tyr Leu 










35 










40 45 


Pro 


Glu 


Ser 


Ala 


Met 


Ser 


Ala 


Tyr Leu Ala Ala Val Ala Leu His 










50 










55 60 


He 


Met 


Leu 


Ser 


Gin 


Leu 


Thr 


Phe He Phe Gly He Met He Ser 










65 










70 75 


Phe 


His 


Ala 


Gly 


Pro 


He 


Ser 


Phe 


Phe 


Tyr Asp He He Asn Tyr 








80 










85 90 


Cys 


Val 


Ala 


Leu 


Pro 


Lys 


Ala 


Asn 


Ser 


Thr Ser He Leu Val Phe 








95 










100 105 


Leu 


Thr 


Val 


Val 


Val 


Ala 


Leu 


Arg 


He 


Asn Lys Cys He Arg He 










110 










115 120 


Ser 


Phe 


Asn 


Gin 


Tyr 


Pro 


He 


Glu 


Phe 


Pro Met Glu Leu Phe Leu 










125 










130 135 


He 


He 


Leu 


Gin 


Ala 


Phe 


Ser 


Leu 


Ser 


Leu Val Ser Ser Phe Leu 










140 










145 150 


Leu 


He 


Phe 


Leu 


Gly Lys 


Lys 


He 


Ala 


Ser Leu His Asn Tyr Ser 










155 










160 165 


Val 


Asn 


Ser 


Asn 


Gin 


Asp 


Leu 


He 


Ala 


He Gly Leu Cys Asn Val 










170 










175 180 


Val 


Ser 


Ser 


Phe 


Phe Arg 


Ser 


Cys Val 


Phe Thr Gly Ala He Ala 










185 










190 195 


Arg 


Thr 


He 


He 


Gin Asp 


Lys 


Ser Gly Gly Ser Thr Thr Val Cys 










200 










205 210 


He 


Ser 


Gly 


Arg 


Arg Arg 


Ala 


Lys 


He Leu Leu Leu Gly Gin He 










215 










220 225 


Pro 


Asn 


Thr 


Asn 


He Tyr 


Arg 


Ser He Asn Asp Tyr Arg Glu He 










230 










235 240 


He 


Thr 


He 


Pro 


Gly Val 


Lys 


He Phe Gin Cys Cys Ser Ser He 










245 










250 255 


Thr 


Phe 


Val 


Asn 


Val 


Tyr 


Tyr 


Leu 


Lys 


His Lys Leu Leu Lys Glu 










260 










265 270 


Val 


Asp 


Met 


Val 


Lys Val 


Pro 


Leu 


Lys 


Glu Glu Glu He Phe Ser 
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275 




O O A 

280 






O Q *i 


Leu 


fne 


Asn 


Ser 


Ser Asp 


mr Asn Lieu Gin 


Gly Gly Lys 


He 


Cys 










290 










inn 


Arg 


Cys 


Phe Cys 


Asn Cys 


ASp ASp Lieu KdxXx 


Pro Leu Pro Arg 


11c 










T AC 




J-LU 






11 R 
Jlj 


L6U 


Tyr 


Thr 


Glu 


Arg 


Pne 


^ "1 i i A T \/n T ^i i 

(jiu Asn Liys Lieu 


Asp Pro Glu 


Ala 


Ser 










"5 O A 

320 










330 


Cor 


vai 


Asn 


Leu 


lie 


His 


Lys bei nlS fne 


Glu Ser Met 


Asn 


Thr 










335 




•j x n 






345 


ser 


bin 


Thr Ala 


Ser 


Glu 


Asp Gin vai pro 


Tyr Thr Val 


Ser 


Ser 










350 










360 


vai 


Ser 


Gin Lys 


Asn 


Gin 


Gly Gin Gin Tyr 


Glu Glu Val 


Glu 


Glu 










365 




370 






375 


Val 


Tip 


Leu 


Pro 


Asn 


Asn 


Ser Ser Arg Asn 


Ser Ser Pro Gly Leu 










380 




385 






390 


Pro 


Asp 


Val 


Ala 


Glu 


Ser 


Gin Gly Arg Arg 


Ser Leu lie 


Pro Tyr 










395 




A A A 

400 






405 


Ser 


Asp 


Ala 


Ser 


Leu 


Leu 


Pro Ser Val His 


Thr He He 


Leu 


Asp 










410 




415 






420 


Phe 


Ser 


Met 


Val 


His 


Tyr 


Val Asp Ser Arg 


Gly Leu Val 


Val 


Leu 










425 




430 






435 


Arg 


Gin 


Val 


Ser 


Thr 


Glu 


Glu Ala Leu Ala 


Gly Ala Leu 


He 


Pro 










440 




445 






450 


Leu 


Leu 


Pro 


Ser 


Gin 


Pro 


His Pro Asp Pro 


Asp 














455 




460 









<210> 13 
<211> 502 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3788427CD1 

<400> 13 



Met 


Ser 


He 


Val 


Arg Leu 


Ser 


Val 


His 


Ala 


Lys Trp He Met Gly 


1 








5 








10 


15 


Lys 


Val 


Thr 


Gly 


Thr Lys 


Met 


Gin 


Lys 


Thr 


Ala Lys Val Arg Val 








20 








25 


30 


He 


Arg 


Leu 


Val 


Leu Asp 


Pro 


His 


Leu 


Leu 


Lys Tyr Tyr Asn Lys 










35 








40 


45 


Gin 


Lys 


Thr 


Tyr 


Phe Ala 


His 


Asn 


Ala 


Leu 


Gin Gin Cys Thr He 








50 








55 


60 


Gly 


Asp 


He 


Val 


Leu Leu 


Lys 


Ala 


Leu 


Pro 


Val Pro Arg Thr Lys 






65 








70 


75 


His 


Val 


Lys 


His 


Glu Leu 


Ala 


Glu 


He 


Val 


Phe Lys Val Gly Lys 










80 








85 


90 


Leu 


Val 


Asp 


Pro 


Val Thr 


Gly 


Lys 


Pro 


Arg 


Ala Gly Thr Thr Tyr 










95 








100 


105 


Leu 


Glu 


Ser 


Pro 


Leu Ser 


Ser 


Glu 


Thr 


Thr 


Gin Gly Val Asp Gly 










110 








115 


120 


Ala 


Ser 


Arg 


Pro 


Ser Arg 


Gly 


Pro 


Ala 


Pro 


Cys Arg Ala Gly Pro 










125 








130 


135 


Gly 


Ala 


Arg 


Arg 


Leu Arg 


Pro 


Trp 


Pro 


Glu 


Ser Pro Arg Pro Glu 










140 








145 


150 


Pro 


Arg 


Gly 


Leu 


Pro Gly 


Pro 


Gly Arg Gly 


Ser Met Ala Thr Trp 










155 








160 


165 


Arg 


Arg 


Asp 


Gly 


Arg Leu 


Thr 


Gly Gly Gin 


Arg Leu Leu Cys Ala 










170 








175 


180 


Gly 


Leu 


Ala 


Gly 


Thr Leu 


Ser 


Leu 


Ser 


Leu 


Thr Ala Pro Leu Glu 










185 








190 


195 


Leu 


Ala 


Thr 


Val 


Leu Ala 


Gin 


Val Gly Val 


Val Arg Gly His Ala 
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200 






205 






210 


Arg 




riu lip 


Ala 


Thr 


oiy 


nis Arg vai rrp 


Arg Ala Glu Gly 








215 






zzv 






zzo 


Leu 


nig 


ftl a T .Qi i 
nld uc U 


Trp 


Lys 




Asn nia vax AJ.Q 


Cys Leu Arg 


Leu 








230 






Z J D 






O A A 


rne 


Drn 

tro 


Cys Ser 


Ala 


Val 


Pi Y1 

urn 


L<eu Aia Aia Tyr 


Arg Lys 


pne 


vai 








245 












OCR 
ZOD 


vai 


Leu 


rile inr 


Asp Asp 


Leu 


biy his lie ber 


Gin Trp 


Ser 


Ser 








260 






2ob 






OTA 
2/0 


lie 


Met 


Ala Gly 


Ser 


Leu 


Ala 


Gly Met Val Ser 


Thr He 


Val 


Thr 








275 






O O A 

280 






285 


Tyr 


Pro 


Thr Asp 


Leu 


He 


Lys 


Thr Arg Leu He 


Met Gin 


Asn 


He 








290 






295 






300 


Leu 


Glu 


Pro Ser 


Tyr Arg 


Gly 


Leu Leu His Ala 


Phe Ser 


Thr 


He 








305 






310 






315 


Tyr 


Gin 


Gin Glu 


Gly Phe 


Leu 


Ala Leu Tyr Arg 


Gly Val 


Ser 


Leu 








320 






325 






330 


Thr 


Val 


Val Gly 


Ala 


Leu 


Pro 


Phe Ser Ala Gly 


Ser Leu 


Leu 


Val 








335 






340 






345 


Tyr 


Met 


Asn Leu 


Glu 


Lys 


He 


Trp Asn Gly Pro 


Arg Asp 


Gin 


Phe 








350 






355 






360 


Ser 


Leu 


Pro Gin 


Asn 


Phe 


Ala 


Asn Val Cys Leu 


Ala Ala 


Ala 


Val 








365 






370 






375 


Thr 


Gin 


Thr Leu 


Ser 


Phe 


Pro 


Pne Glu Tnr Val 


Lys Arg 


Lys 


Met 








380 






o o c 

385 






390 


Gin 


Ala 


Gin Ser 


Pro 


Tyr 


Leu 


Pro His Ser Gly 


Gly Val 


Asp 


Val 








395 






400 






405 


His 


Phe 


Ser Gly 


Ala 


Val 


Asp 


Cys Phe Arg Gin 


He Val 


Lys 


Ala 








410 






415 






420 


Gin 


Gly 


Val Leu 


Gly Leu 


Trp 


Asn Gly Leu Thr 


Ala Asn 


Leu 


Leu 








425 






A "> A 






435 


Lys 


lie 


Val Pro 


Tyr 


Phe 


Gly 


Tin vr n t> nVa Cov 

lie Met pne oer 


Thr Phe 


Glu 


Phe 








440 






445 






450 


Cys 


Lys 


Arg lie 


Cys 


Leu 


Tyr 


Gin Asn Gly Tyr 


He Leu 


Ser 


Pro 








455 






460 






465 


Leu 


Ser 


Tyr Lys 


Leu 


Thr 


Pro 


Gly Val Asp Gin 


Ser Leu 


Gin 


Pro 






470 






475 






480 


Gin 


Glu 


Leu Arg 


Glu 


Leu 


Lys 


Lys Phe Phe Lys 


Thr Arg Lys 


Leu 








485 






490 






495 


Lys 


Pro 


Lys Lys 


Pro 


Thr 


Leu 
















500 















<210> 14 
<211> 261 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6972455CD1 

<400> 14 

Met Thr Lys Arg Tyr Ser Ala Leu Leu Thr Ala Leu Phe Ala Ser 
15 10 15 

Leu Met Leu Ser Gin Ala Pro Ala Gin Ala Ser Gly Leu Asp Asp 

20 25 30 

He Val Ala Arg Gly Thr Leu Lys Val Ala Val Pro Gin Asp Phe 

35 40 45 

Pro Pro Phe Gly Ser Val Gly Pro Asp Met Gin Pro Arg Gly Leu 

50 55 60 

Asp He Asp Thr Ala Lys Leu Leu Ala Asp Gin Leu Lys Val Lys 

65 70 75 

Leu Glu Leu Thr Pro Val Asn Ser Thr Asn Arg He Pro Phe Leu 
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80 85 90 

Thr Thr Gly Lys Val Asp Leu Val lie Ser Ser Leu Gly Lys Asn 
95 100 105 

Pro Glu Arg Ala Lys Val lie Asp Phe Ser Asn Ala Tyr Ala Pro 

110 115 120 

Phe Tyr Leu Ala Val Phe Gly Pro Pro Asp Ala Ala He Ala Ser 

125 130 135 

Leu Asp Asp Leu Lys Gly Lys Thr He Ser Val Thr Arg Gly Ala 

140 145 150 

He Glu Asp He Glu Leu Thr Ala Val Ala Pro Lys Glu Ala Thr 

155 160 165 

He Lys Arg Phe Glu Asp Asn Asn Ser Thr He Ala Ala Tyr Leu 

170 175 180 

Ala Gly Gin Val Asp Leu lie Ala Ser Gly Asn Val Val Met Val 

185 190 195 

Ala He Ser Glu Arg Asn Pro Lys Arg Val Pro Ala Leu Lys Val 

200 205 210 

Lys Leu Lys Asp Ser Pro Val Tyr Val Gly Val Asn Lys Asn Glu 

215 220 225 

Pro Ala Leu Leu Glu Lys Val Asn Gin He Leu Val Ala Ala Lys 

230 235 240 

Ala Asp Gly Ser Leu Gly Lys Asn Ala Met Gin Trp Leu Lys Glu 

245 250 255 

Pro Leu Pro Ala Asp Leu 

260 

<210> 15 
<211> 570 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 8077668CD1 

<400> 15 

Met Tyr Asp Pro Pro Lys Val Asn Lys Lys Asn Asp Ser Ala Asp 
1 5 10 15 

His Ser Asp Glu Ser Val Glu He Gin Ser Ala Leu His He Cys 
20 25 30 

Gly Phe Asp Gin Gly Arg Trp lie Leu Ser Tyr Arg Arg Ser Pro 
35 40 45 

Ala Arg Arg His Thr Gly Leu Ala Pro Val Leu Gly Val Arg Val 
50 55 60 

Arg Val Cys Val Gly Cys Gin Gly Ala Arg Val Pro Ala Trp Pro 
65 70 75 

Lys Arg Glu Gly Arg Gly Ser Ala Glu Glu Pro Val Glu Arg Lys 
80 85 90 

Asp Gly Gin Gin Gly Glu Arg Arg Gly Glu Ala Ser Leu Thr Lys 
95 100 105 

Asp Ala Val Thr Phe Gin Ser Ser Leu Pro Val Gin lie Thr Arg 
110 115 120 

He Pro Ala Cys Trp Arg Met Thr Gin Met Ser Gin Val Gin Glu 
125 130 135 

Leu Phe His Glu Ala Ala Gin Gin Asp Ala Leu Ala Gin Pro Gin 
140 145 150 

Pro Trp Trp Lys Thr Gin Leu Phe Met Trp Glu Pro Val Leu Phe 
155 160 165 

Gly Thr Trp Asp Gly Val Phe Thr Ser Cys Met lie Asn He Phe 
170 175 180 

Gly Val Val Leu Phe Leu Arg Thr Gly Trp Leu Val Gly Asn Thr 
185 190 195 

Gly Val Leu Leu Gly Met Phe Leu Val Ser Phe Val lie Leu Val 
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<210> 16 

<211> 1033 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 55120485CD1 

<400> 16 

Met Glu Glu Asn Ser Lys Lys Asp His Arg Ala Leu Leu Asn Gin 
15 10 15 
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<210> 17 
<211> 496 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 3112883CD1 
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Ser Gly 


Lys 


Leu Asn 


Lys 


Gly 


Gin 


Leu 


Asn 


Ser 


Leu 


Lys Pro Ser 






245 










250 






255 


Asn Val 


Thr 


Val Asp 


Val 


Phe 


Val 


Gin Trp Phe Gin Asp Leu Lys 






260 










265 






270 


Glu Cys 


Tyr 


Ser Ser 


Lys 


Arg 


Leu 


Phe Tyr Trp Ser 


Leu Trp Trp 






275 










280 






285 


Ala Phe 


Ala 


Thr Ala 


Gly 


Phe 


Asn 


Gin 


Val 


Leu Asn Tyr Val Gin 






290 










295 






300 


He Leu 


Trp 


Asp Tyr 


Lys 


Ala 


Pro 


Ser Gin Asp Ser 


Ser He Tyr 




305 










310 






315 
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Asn 




Ala 


vai 


Glu 


Ala 


He 


Ala Thr Phe Gly Gly Ala Val Ala 










320 






325 




Ala 


fne 


Ala 


vax 


Gly Tyr Val 


Lys Val Asn Trp Asp Leu Leu 


pi,. 
GJ.y 










335 






340 




Glu 


Leu 


Ala 


Leu 


Val 


Val 


Phe 


Ser Val val Asn Aia uiy ber 


Leu 










350 






355 


360 


Pne 


Leu 


Met 


His 


Tyr Thr Ala 


Asn He Trp Ala Cys Tyr Ala 


Gly 










365 






370 


375 


Tyr 


Leu 


He 


Phe 


Lys 


Ser 


Ser 


Tyr Met Leu Leu He Thr He 


Ala 








380 






385 


390 


Val 


Pne 


Gin 


He 


Ala 


Val 


Asn 


Leu Asn Val Glu Arg Tyr Ala 


Leu 










395 






400 


405 


Val 


Phe 


Gly He 


Asn 


Thr 


Phe 


He Ala Leu Val He Gin Thr 


He 










410 






415 


420 


Met 


Thr 


Val 


He 


Val 


Val 


Asp 


Gin Arg Gly Leu Asn Leu Pro Val 










425 




430 


435 


Ser 


lie 


Gin 


Phe 


Leu 


Val 


Tyr 


Gly Ser Tyr Phe Ala Val He 


Ala 










440 






445 


450 


Gly 


He 


Phe 


Leu 


Met 


Arg 


Ser 


Met Tyr He Thr Tyr Ser Thr 


Lys 










455 




460 


465 


Ser 


Gin 


Lys 


Asp 


Val 


Gin 


Ser 


Pro Ala Pro Ser Glu Asn Pro 


Asp 






470 






475 


480 


Val 


Ser 


His 


Pro 


Glu 


Glu 


Glu 


Ser Asn He He Met Ser Thr 


Lys 










485 






490 


495 


Leu 



















<210> 18 

<211> 573 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4253888CD1 

<400> 18 

Met Thr Asn Ser Lys Gly Arg Ser He Thr Asp Lys Thr Ser Gly 
15 10 15 

Gly Pro Ser Ser Gly Gly Gly Phe Val Asp Trp Thr Leu Arg Leu 
20 25 30 

Asn Thr He Gin Ser Asp Lys Phe Leu Asn Leu Leu Leu Ser Met 
35 40 45 

Val Pro Val He Tyr Gin Lys Asn Gin Glu Asp Arg His Lys Lys 
50 55 60 

Pro Asn Gly He Trp Gin Asp Gly Leu Ser Thr Ala Val Gin Thr 
65 70 75 

Phe Ser Asn Arg Ser Glu Gin His Met Glu Tyr His Ser Phe Ser 
80 85 90 

Glu Gin Ser Phe His Ala Asn Asn Gly His Ala Ser Ser Ser Cys 
95 100 105 

Ser Gin Lys Tyr Asp Asp Tyr Ala Asn Tyr Asn Tyr Cys Asp Gly 
110 115 120 

Arg Glu Thr Ser Glu Thr Thr Ala Met Leu Gin Asp Glu Asp He 
125 130 135 

Ser Ser Asp Gly Asp Glu Asp Ala He Val Glu Val Thr Pro Lys 
140 145 150 

Leu Pro Lys Glu Ser Ser Gly He Met Ala Leu Gin He Leu Val 
155 160 165 

Pro Phe Leu Leu Ala Gly Phe Gly Thr Val Ser Ala Gly Met Val 
170 175 180 

Leu Asp He Val Gin His Trp Glu Val Phe Arg Lys Val Thr Glu 
185 190 195 
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Val Phe lie Leu Val Pro Ala Leu Leu Gly Leu Lys Gly Asn Leu 

200 205 210 

Glu Met Thr Leu Ala Ser Arg Leu Ser Thr Ala Val Asn lie Gly 

215 220 225 

Lys Met Asp Ser Pro He Glu Lys Trp Asn Leu He He Gly Asn 

230 235 240 

Leu Ala Leu Lys Gin Val Gin Ala Thr Val Val Gly Phe Leu Ala 

245 250 255 

Ala Val Ala Ala He He Leu Gly Trp He Pro Glu Gly Lys Tyr 

260 265 270 

Tyr Leu Asp His Ser He Leu Leu Cys Ser Ser Ser Val Ala Thr 

275 280 285 

Ala Phe He Ala Ser Leu Leu Gin Gly He He Met Val Gly Val 

290 295 300 

He Val Gly Ser Lys Lys Thr Gly He Asn Pro Asp Asn Val Ala 

305 310 315 

Thr Pro He Ala Ala Ser Phe Gly Asp Leu He Thr Leu Ala He 

320 325 330 

Leu Ala Trp He Ser Gin Gly Leu Tyr Ser Cys Leu Glu Thr Tyr 

335 340 345 

Tyr Tyr He Ser Pro Leu Val Gly Val Phe Phe Leu Ala Leu Thr 

350 355 360 

Pro He Trp He He He Ala Ala Lys His Pro Ala Thr Arg Thr 

365 370 375 

Val Leu His Ser Gly Trp Glu Pro Val He Thr Ala Met Val He 

380 385 390 

Ser Ser He Gly Gly Leu He Leu Asp Thr Thr Val Ser Asp Pro 

395 400 405 

Asn Leu Val Gly He Val Val Tyr Thr Pro Val He Asn Gly He 

410 415 420 

Gly Gly Asn Leu Val Ala He Gin Ala Ser Arg lie Ser Thr Tyr 

425 430 435 

Leu His Leu His Ser He Pro Gly Glu Leu Pro Asp Glu Pro Lys 

440 445 450 

Gly Cys Tyr Tyr Pro Phe Arg Thr Phe Phe Gly Pro Gly Val Asn 

455 460 465 

Asn Lys Ser Ala Gin Val Leu Leu Leu Leu Val He Pro Gly His 

470 475 480 

Leu He Phe Leu Tyr Thr He His Leu Met Lys Ser Gly His Thr 

485 490 495 

Ser Leu Thr He He Phe He Val Val Tyr Leu Phe Gly Ala Val 

500 505 510 

Leu Gin Val Phe Thr Leu Leu Trp He Ala Asp Trp Met Val His 

515 520 525 

His Phe Trp Arg Lys Gly Lys Asp Pro Asp Ser Phe Ser He Pro 

530 535 540 

Tyr Leu Thr Ala Leu Gly Asp Leu Leu Gly Thr Ala Leu Leu Ala 

545 550 555 

Leu Ser Phe His Phe Leu Trp Leu He Gly Asp Arg Asp Gly Asp 

560 565 570 

Val Gly Asp 



<210> 19 
<211> 573 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7479974CD1 

<400> 19 
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Met Asp Ala Val Lys Tyr Leu Asn Lys Leu Asn Leu Asp Asn lie 
1 5 10 15 

Glu Leu Thr Lys Tyr Leu Phe Phe Thr Gly Lys Gly Gly Val Gly 
20 25 30 

Lys Thr Thr He Ser Ser Phe lie Ala Leu Asn Leu Ala Glu Asn 
35 40 45 

Gly Lys Lys Val Ala Leu Val Ser Thr Asp Pro Ala Ser Asn Leu 
50 55 60 

Gin Asp Val Phe Gin Met Glu Leu Ser Asn Lys Leu Thr Lys Tyr 
65 70 75 

Gin Pro He Pro Asn Leu Ser He Ala Asn Phe Asp Pro He Val 
80 85 90 

Ala Ala Asp Asp Tyr Lys Ala Gin Ser He Glu Pro Tyr Glu Gly 
95 100 105 

He Leu Pro Glu Asp Val Leu Ala Glu Met Lys Glu Gin Leu Ser 

110 115 120 

Gly Ser Cys Thr Val Glu Val Ala Ala Phe Asn Glu Phe Thr Asn 

125 130 135 

Phe Leu Ser Asp Lys Thr Leu Glu Gin Glu Phe Asp Phe He He 

140 145 150 

Phe Asp Thr Ala Pro Thr Gly His Thr Leu Arg Met Leu Glu Leu 

155 160 165 

Pro Ser Ala Trp Thr Asp Tyr Leu Asn Thr Thr Ser Asn Asp Ala 

170 175 180 

Ser Cys Leu Gly Gin Leu Ser Gly Leu Asn Glu Asn Arg Val Lys 

185 190 195 

Tyr Asn Ser Ala Leu Glu Lys Leu Arg Asn Gin Asp Asp Thr Thr 

200 205 210 

Met Met Leu Val Ala Arg Pro Thr His Ser Ser He Tyr Glu He 

215 220 225 

Gin Arg Ala Gin Gin Glu Leu Gin Gin Leu Ser He Ser Lys Phe 

230 235 240 

Lys Val He He Asn Asn Tyr He Glu Glu Ser His Gly Leu He 

245 250 255 

Ser Ser Gin Met Lys Ser Glu Gin Asp Lys Asn He Asn His Phe 

260 265 270 

Thr Glu Trp Leu Asn Asn Asn His Ala Tyr Tyr Val Pro Tyr Lys 

275 280 285 

Lys Gin Lys Glu Glu Gly He Glu Asn Leu Thr Asn Leu Leu Asn 

290 295 300 

Asp Asp Asn Leu He Glu Asn Asp Asp Phe He Val Glu Asp His 

305 310 315 

Pro Gin Phe Asn Lys Leu He Asp Glu He Glu Asn Ser Lys Val 

320 325 330 

Gin Tyr Leu Phe Thr Met Gly Lys Gly Gly Val Gly Lys Thr Thr 

335 340 345 

Val Ala Thr Gin Leu Ala Thr Ala Leu Ser Asn Lys Gly Tyr Arg 

350 355 360 

Val Leu Leu Ala Thr Thr Asp Pro Thr Lys Glu He Asn Val Glu 

365 370 375 

Thr Thr Ser Asn Leu Asn Thr Ala Tyr He Asp Glu Glu Gin Ala 

380 385 390 

Leu Glu Lys Tyr Lys Lys Glu Val Leu Ala Thr Val Asn Asp Asp 

395 400 405 

Thr Pro Gin Asp Asp He Asp Tyr He Met Glu Asp Leu Lys Ser 

410 415 420 

Pro Cys Thr Glu Glu He Ala Phe Phe Lys Ala Phe Ser Asp He 

425 430 435 

Met Glu Asn Gin Asp Asp Met Asp Tyr Val lie Val Asp Thr Ala 

440 445 450 

Pro Thr Gly His Thr Leu Leu Leu Leu Asp Ser Ser Glu Asn His 

455 460 465 

His Arg Glu Leu Lys Lys Lys Ser Thr Gin Thr Thr Ser Asn Val 
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470 






Glu Thr 


Leu 


Leu 


Pro 

485 


Lys 


He Gin 


He He 


Val 


Thr 


Leu 
bOO 


Ala 


Glu Lys 


Axg Leu 


Val 


11 


Asp 
515 


Leu 


/toll niy 


Trp Val 


Val 


Asn 


Gin 
530 


Ser 


Leu Val 


Leu Phe 


Ser 


Asn 


Lys 
545 


Lys 


Glu Asp 


He Lys 


Asn 


Glu 


Ser 
560 


Phe 


Asp Asn 


Gly Leu 


Ser 













475 


480 


Asn 


Lys 


Asn Leu Thr Gin Met 




490 


495 


Thr 


Pro 


Tyr Leu Glu Ser Lys 




505 


510 


Ala 


Asn 


He Gly His Asn Trp 




520 


525 


Thr Leu Asn Gin Arg Asp Asp 




535 


540 


Glu 


Ser 


Phe Trp He Asn Lys 




550 


555 


Tyr 


Phe 


Val He Pro Tyr Gly 




565 


570 



<210> 20 
<211> 248 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483850CD1 

<400> 20 

Met Asn Ser Asn Leu Asn Leu Asp Gly Phe Leu Leu Pro He Ala 
15 10 15 

Val Met Asn Ala He Ser Ser Leu Pro Leu Leu He Leu Ala Pro 
20 25 30 

Phe Leu Glu Tyr Phe Ser Thr Cys Leu Phe Pro Ser Lys Arg Val 
35 40 45 

Gly Ser Phe Leu Ser Thr Cys He He Ala Gly Asn Leu Phe Ala 
50 55 60 

Ala Leu Ser Val Met He Ala Gly Phe Phe Glu He His Arg Lys 
65 70 75 

His Phe Pro Ala Val Glu Gin Pro Leu Ser Gly Lys Val Leu Thr 
80 85 90 

Val Ser Ser Met Pro Cys Phe Tyr Leu He Leu Gin Tyr Val Leu 
95 100 105 

Leu Gly Val Ala Glu Thr Leu Val Asn Pro Ala Leu Ser Val He 
110 115 120 

Ser Tyr Arg Phe Val Pro Ser Asn Val Arg Gly Thr Ser Met Asn 
125 130 135 

Phe Leu Thr Leu Phe Asn Gly Phe Gly Cys Phe Thr Gly Ala Leu 
140 145 150 

Leu Val Lys Leu Val Tyr Leu He Ser Glu Gly Lys Asn Arg Gin 
155 160 165 

Trp Phe Pro Asn Thr Leu Asn Lys Gly Asn Leu Glu Gly Phe Phe 
170 175 180 

Phe Phe Leu Ala Ser Leu Thr Leu Leu Asn Val Leu Gly Phe Cys 
185 190 195 

Ser Val Ser Gin Arg Tyr Cys Asn Leu Asn His Phe Asn Ala Gin 
200 205 210 

Asn He Arg Gly Ser Asn Leu Glu Glu Thr Leu Leu Leu His Glu 
215 220 225 

Lys Ser Leu Lys Phe Tyr Gly Ser He Gin Glu Phe Ser Ser Ser 
230 235 240 

He Asp Leu Trp Glu Thr Ala Leu 
245 

<210> 21 
<211> 761 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5508353CD1 



<400> 21 

Met Lys Pro Ala Ser Pro Val Glu Glu Glu Val Ser Gin Val Cys 
15 10 15 

Glu Ser Pro Gin Cys Ser Ser Ser Ser Ala Cys Cys Thr Glu Thr 
20 25 30 

Glu Lys Gin His Gly Asp Ala Gly Leu Leu Asn Gly Lys Ala Glu 
35 40 45 

Ser Leu Pro Gly Gin Pro Leu Ala Cys Asn Leu Cys Tyr Glu Ala 
50 55 60 

Glu Ser Pro Asp Glu Ala Ala Leu Val Tyr Ala Ala Arg Ala Tyr 
65 70 75 

Gin Cys Thr Leu Arg Ser Arg Thr Pro Glu Gin Val Met Val Asp 
80 85 90 

Phe Ala Ala Leu Gly Pro Leu Thr Phe Gin Leu Leu His He Leu 
95 100 105 

Pro Phe Asp Ser Val Arg Lys Arg Met Ser Val Val Val Arg His 

110 115 120 

Pro Leu Ser Asn Gin Val Val Val Tyr Thr Lys Gly Ala Asp Ser 

125 130 135 

Val He Met Glu Leu Leu Ser Val Ala Ser Pro Asp Gly Ala Ser 

140 145 150 

Leu Glu Lys Gin Gin Met He Val Arg Glu Lys Thr Gin Lys His 

155 160 165 

Leu Asp Asp Tyr Ala Lys Gin Gly Leu Arg Thr Leu Cys He Ala 

170 175 180 

Lys Lys Val Met Ser Asp Thr Glu Tyr Ala Glu Trp Leu Arg Asn 

185 190 195 

His Phe Leu Ala Glu Thr Ser He Asp Asn Arg Glu Glu Leu Leu 

200 205 210 

Leu Glu Ser Ala Met Arg Leu Glu Asn Lys Leu Thr Leu Leu Gly 

215 220 225 

Ala Thr Gly He Glu Asp Arg Leu Gin Glu Gly Val Pro Glu Ser 

230 235 240 

He Glu Ala Leu His Lys Ala Gly He Lys He Trp Met Leu Thr 

245 250 255 

Gly Asp Lys Gin Glu Thr Ala Val Asn He Ala Tyr Ala Cys Lys 

260 265 270 

Leu Leu Glu Pro Asp Asp Lys Leu Phe He Leu Asn Thr Gin Ser 

275 280 285 

Lys Asp Ala Cys Gly Met Leu Met Ser Thr He Leu Lys Glu Leu 

290 295 300 

Gin Lys Lys Thr Gin Ala Leu Pro Glu Gin Val Ser Leu Ser Glu 

305 310 315 

Asp Leu Leu Gin Pro Pro Val Pro Arg Asp Ser Gly Leu Arg Ala 

320 325 330 

Gly Leu He He Thr Gly Lys Thr Leu Glu Phe Ala Leu Gin Glu 

335 340 345 

Ser Leu Gin Lys Gin Phe Leu Glu Leu Thr Ser Trp Gly Gin Ala 

350 355 360 

Val Val Cys Cys Arg Ala Thr Pro Leu Gin Lys Ser Glu Val Val 

365 370 375 

Lys Leu Val Arg Ser His Leu Gin Val Met Thr Leu Ala He Gly 

380 385 390 

Asp Gly Ala Asn Asp Val Ser Met He Gin Val Ala Asp He Gly 

395 400 405 

He Gly Val Ser Gly Gin Glu Gly Met Gin Ala Val Met Ala Ser 
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410 415 420 

Asp Phe Ala Val Ser Gin Phe Lys His Leu Ser Lys Leu Leu Leu 

425 430 435 

Val His Gly His Trp Cys Tyr Thr Arg Leu Ser Asn Met He Leu 

440 445 450 

Tyr Phe Phe Tyr Lys Asn Val Ala Tyr Val Asn Leu Leu Phe Trp 

455 460 465 

Tyr Gin Phe Phe Cys Gly Phe Ser Gly Thr Ser Met Thr Asp Tyr 

470 475 480 

Trp Val Leu He Phe Phe Asn Leu Leu Phe Thr Ser Ala Pro Pro 

485 490 495 

Val He Tyr Gly Val Leu Glu Lys Asp Val Ser Ala Glu Thr Leu 

500 505 510 

Met Gin Leu Pro Glu Leu Tyr Lys Ser Gly Gin Lys Ser Glu Ala 

515 520 525 

Tyr Leu Pro His Thr Phe Trp He Thr Leu Leu Asp Ala Phe Tyr 

530 535 540 

Gin Ser Leu Val Cys Phe Phe Val Pro Tyr Phe Thr Tyr Gin Gly 

545 550 555 

Ser Asp Thr Asp He Phe Ala Phe Gly Asn Pro Leu Asn Thr Ala 

560 565 570 

Ala Leu Phe He Val Leu Leu His Leu Val He Glu Ser Lys Ser 

575 580 585 

Leu Thr Trp He His Leu Leu Val He He Gly Ser He Leu Ser 

590 595 600 

Tyr Phe Leu Phe Ala He Val Phe Gly Ala Met Cys Val Thr Cys 

605 610 615 

Asn Pro Pro Ser Asn Pro Tyr Trp He Met Gin Glu His Met Leu 

620 625 630 

Asp Pro Val Phe Tyr Leu Val Cys He Leu Thr Thr Ser He Ala 

635 640 645 

Leu Leu Pro Arg Phe Val Tyr Arg Val Leu Gin Gly Ser Leu Phe 

650 655 660 

Pro Ser Pro He Leu Arg Ala Lys His Phe Asp Arg Leu Thr Pro 

665 670 675 

Glu Glu Arg Thr Lys Ala Leu Lys Lys Trp Arg Gly Ala Gly Lys 

680 685 690 

Met Asn Gin Val Thr Ser Lys Tyr Ala Asn Gin Ser Ala Gly Lys 

695 700 705 

Ser Gly Arg Arg Pro Met Pro Gly Pro Ser Ala Val Phe Ala Met 

710 715 720 

Lys Ser Ala Ser Ser Cys Ala He Glu Gin Gly Asn Leu Ser Leu 

725 730 735 

Cys Glu Thr Ala Leu Asp Gin Gly Tyr Ser Glu Thr Lys Ala Phe 

740 745 750 

Glu Met Ala Gly Pro Ser Lys Gly Lys Glu Ser 

755 760 



<210> 22 

<211> 219 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 8543628CD1 



<400> 22 

Met Ala Ser Ser Gly Leu Glu Leu Leu Trp Val Ser Leu Pro Gin 

15 10 15 

Leu Gly Lys Gly Ala Ala Gin Thr Leu Ser He Ser Phe Leu Ser 

20 25 30 

He Gly Phe Ser Thr Val Gly Gly Val Leu Tyr Gly Val Leu Arg 
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35 

Thr Leu Asn Asn Lys Ala 
50 

Glu Leu Phe Arg Ala lie 
65 

Phe Phe Gly Val Pro lie 
80 

Trp Cys Ala Val Leu Val 
95 

Gly Glu Val Val Arg Gly 
110 

Arg Glu Ala Gly Leu Ser 
125 

Gly Tyr Val Leu Leu Pro 
140 

Thr lie Asn Val Tyr Thr 
155 

Val Leu lie Gly Val Val 
170 

He Glu Arg Thr Tyr Glu 
185 

Leu Phe Phe Phe Phe He 
200 

Leu Leu Glu Arg Arg Trp 
215 

<210> 23 
<211> 463 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 7482754CD1 

<400> 23 



Met 


Glu 


Gly 


Gin 


Thr 


Pro 


Gly 


Ser Arg Gly Leu Pro Glu Lys Pro 


1 






5 






10 15 


His 


Pro 


Ala 


Thr 


Ala 


Ala 


Ala 


Thr Leu Ser Ser Met Gly Ala Val 










20 






25 30 


Phe 


He 


Leu 


Met 


Lys 


Ser 


Ala 


Leu Gly Ala Gly Leu Leu Asn Phe 










35 






40 45 


Pro 


Trp 


Ala 


Phe 


Ser 


Lys 


Ala 


Gly Gly Val Val Pro Ala Phe Leu 








50 






55 60 


Val 


Glu 


Leu 


Val 


Ser 


Leu 


Val 


Phe Leu He Ser Gly Leu Val He 










65 






70 75 


Leu 


Gly 


Tyr 


Ala 


Ala 


Ala 


Val 


Ser Gly Gin Ala Thr Tyr Gin Gly 










80 






85 90 


Val 


Val 


Arg 


Gly 


Leu 


Cys 


Gly 


Pro Ala He Gly Lys Leu Cys Glu 










95 






100 105 


Ala 


Cys 


Phe 


Leu 


Leu 


Asn 


Leu 


Leu Met He Ser Val Ala Phe Leu 








110 






115 120 


Arg 


Val 


He 


Gly 


Asp Gin 


Leu 


Glu Lys Leu Cys Asp Ser Leu Leu 










125 






130 135 


Ser 


Gly 


Thr 


Pro 


Pro 


Ala 


Pro 


Gin Pro Trp Tyr Ala Asp Gin Arg 








140 






145 150- 


Phe 


Thr 


Leu 


Pro 


Leu 


Leu 


Ser 


Val Leu Val He Leu Pro Leu Ser 










155 






160 165 


Ala 


Pro 


Arg 


Glu 


He 


Ala 


Phe 


Gin Lys Tyr Thr Ser Pro Ser His 








170 






175 180 


Gly 


His 


Cys 


Val 


Ser 


He 


Leu 


Gly Thr Leu Ala Ala Cys Tyr Leu 










185 






190 195 


Ala 


Leu 


Val 


He 


Thr Val 


Gin 


Tyr Tyr Leu Trp Pro Gin Gly Leu 



He Asn 
Pro Val 
Phe Phe 
Leu Ser 
Ala Leu 
He Gly 
Gin Ala 
Arg He 
Asp Val 
Ser Val 
Cys Tyr 
Ala Gin 



40 

Gly Val 
55 

Leu Val 
70 

Gly Leu 
85 

Leu Trp 

100 
His Ser 

115 
Leu Ser 

130 
Leu Arg 

145 
He Lys 

160 
He Lys 

175 
Leu He 

190 
Pro Leu 

205 

Ala 



Leu Arg Val 
Trp Leu Tyr 
Ser He Pro 
Gly Ala Ser 
Leu Pro Arg 
Asp Leu Gin 
Arg Met Thr 
Thr Ser Ser 
Val Gly Gin 
Tyr Gly Ala 
Ser Ala Ala 



45 

Tyr Leu 
60 

Leu Leu 
75 

Ser Phe 
90 

Glu Val 

105 
Gly Gin 

120 
Leu Tyr 

135 
Pro Pro 

150 
Leu Ala 

165 
Gin He 

180 
Leu Phe 

195 
Ser Lys 

210 
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200 205 210 

Val Arg Glu Ser His Pro Ser Leu Ser Pro Ala Ser Trp Thr Ser 

215 220 225 

Val Phe Ser Val Phe Pro Thr He Cys Phe Gly Phe Gin Cys His 

230 235 240 

Glu Ala Ala Val Ser He Tyr Cys Ser Met Arg Lys Arg Ser Leu 

245 250 255 

Ser His Trp Ala Leu Val Ser Val Leu Ser Leu Leu Ala Cys Cys 

260 265 270 

Leu He Tyr Ser Leu Thr Gly Val Tyr Gly Phe Leu Thr Phe Gly 

275 280 285 

Thr Glu Val Ser Ala Asp Val Leu Met Ser Tyr Pro Gly Asn Asp 

290 295 300 

Met Val He He Val Ala Arg Val Leu Phe Ala Val Ser He Val 

305 310 315 

Thr Val Tyr Pro He Val Leu Phe Leu Gly Arg Ser Val Met Gin 

320 325 330 

Asp Phe Trp Arg Arg Ser Cys Leu Gly Gly Trp Gly Pro Ser Ala 

335 340 345 

Leu Ala Asp Pro Ser Gly Leu Trp Val Arg Met Pro Leu Thr He 

350 355 360 

Leu Trp Val Thr Val Thr Leu Ala Met Ala Leu Phe Met Pro Asp 

365 370 375 

Leu Ser Glu He Val Ser He He Gly Gly He Ser Ser Phe Phe 

380 385 390 

He Phe He Phe Pro Gly Pro Gly Gly Lys His Arg Phe Cys Glu 

395 400 405 

Gly Trp Phe Leu Arg Lys Pro Ser Gin Thr Ala Gin Ala Leu Ser 

410 415 420 

Glu Lys Gly Lys Pro Ala Arg Phe Val Pro His Leu Cys Asn Gly 

425 430 435 

Cys Arg Ala Tyr Arg Thr Lys Ser Gin Val Arg Asn Leu Arg Ala 

440 445 450 

Thr Gly Arg Glu Ser Leu Ser Thr Gin Tyr Glu Gly He 

455 460 

<210> 24 

<211> 1043 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3794818CD1 

<400> 24 

Met Glu Phe Val Arg Ala Leu Trp Leu Gly Leu Ala Leu Ala Leu 
15 10 15 

Gly Pro Gly Ser Ala Gly Gly His Pro Gin Pro Cys Gly Val Leu 

20 25 30 

Ala Arg Leu Gly Gly Ser Val Arg Leu Gly Ala Leu Leu Pro Arg 

35 40 45 

Ala Pro Leu Ala Arg Ala Arg Ala Arg Ala Ala Leu Ala Arg Ala 

50 55 60 

Ala Leu Ala Pro Arg Leu Pro His Asn Leu Ser Leu Glu Leu Val 

65 70 75 

Val Ala Ala Pro Pro Ala Arg Asp Pro Ala Ser Leu Thr Arg Gly 

80 85 90 

Leu Cys Gin Ala Leu Val Pro Pro Gly Val Ala Ala Leu Leu Ala 

95 100 105 

Phe Pro Glu Ala Arg Pro Glu Leu Leu Gin Leu His Phe Leu Ala 
110 115 120 

Ala Ala Thr Glu Thr Pro Val Leu Ser Leu Leu Arg Arg Glu Ala 
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125 130 135 

Arg Ala Pro Leu Gly Ala Pro Asn Pro Phe His Leu Gin Leu His 

140 145 150 

Trp Ala Ser Pro Leu Glu Thr Leu Leu Asp Val Leu Val Ala Val 

155 160 165 

Leu Gin Ala His Ala Trp Glu Asp Val Gly Leu Ala Leu Cys Arg 

170 175 180 

Thr Gin Asp Pro Gly Gly Leu Val Ala Leu Trp Thr Ser Arg Ala 

185 190 195 

Gly Arg Pro Pro Gin Leu Val Leu Asp Leu Ser Arg Arg Asp Thr 

200 205 210 

Gly Asp Ala Gly Leu Arg Ala Arg Leu Ala Pro Met Ala Ala Pro 

215 220 225 

Val Gly Gly Glu Ala Pro Val Pro Ala Ala Val Leu Leu Gly Cys 

230 235 240 

Asp He Ala Arg Ala Arg Arg Val Leu Glu Ala Val Pro Pro Gly 

245 250 255 

Pro His Trp Leu Leu Gly Thr Pro Leu Pro Pro Lys Ala Leu Pro 

260 265 270 

Thr Ala Gly Leu Pro Pro Gly Leu Leu Ala Leu Gly Glu Val Ala 

275 280 285 

Arg Pro Pro Leu Glu Ala Ala He His Asp He Val Gin Leu Val 

290 295 300 

Ala Arg Ala Leu Gly Ser Ala Ala Gin Val Gin Pro Lys Arg Ala 

305 310 315 

Leu Leu Pro Ala Pro Val Asn Cys Gly Asp Leu Gin Pro Ala Gly 

320 325 330 

Pro Glu Ser Pro Gly Arg Phe Leu Ala Arg Phe Leu Ala Asn Thr 

335 340 345 

Ser Phe Gin Gly Arg Thr Gly Pro Val Trp Val Thr Gly Ser Ser 

350 355 360 

Gin Val His Met Ser Arg His Phe Lys Val Trp Ser Leu Arg Arg 

365 370 375 

Asp Pro Arg Gly Ala Pro Ala Trp Ala Thr Val Gly Ser Trp Arg 

380 385 390 

Asp Gly Gin Leu Asp Leu Glu Pro Gly Gly Ala Ser Ala Arg Pro 

395 400 405 

Pro Pro Pro Gin Gly Ala Gin Val Trp Pro Lys Leu Arg Val Val 

410 415 420 

Thr Leu Leu Glu His Pro Phe Val Phe Ala Arg Asp Pro Asp Glu 

425 430 435 

Asp Gly Gin Cys Pro Ala Gly Gin Leu Cys Leu Asp Pro Gly Thr 

440 445 450 

Asn Asp Ser Ala Thr Leu Asp Ala Leu Phe Ala Ala Leu Ala Asn 

455 460 465 

Gly Ser Ala Pro Arg Ala Leu Arg Lys Cys Cys Tyr Gly Tyr Cys 

470 475 480 

He Asp Leu Leu Glu Arg Leu Ala Glu Asp Thr Pro Phe Asp Phe 

485 490 495 

Glu Leu Tyr Leu Val Gly Asp Gly Lys Tyr Gly Ala Leu Arg Asp 

500 505 510 

Gly Arg Trp Thr Gly Leu Val Gly Asp Leu Leu Ala Gly Arg Ala 

515 520 525 

His Met Ala Val Thr Ser Phe Ser He Asn Ser Ala Arg Ser Gin 

530 535 540 

Val Val Asp Phe Thr Ser Pro Phe Phe Ser Thr Ser Leu Gly He 

545 550 555 

Met Val Arg Ala Arg Asp Thr Ala Ser Pro He Gly Ala Phe Met 

560 565 570 

Trp Pro Leu His Trp Ser Thr Trp Leu Gly Val Phe Ala Ala Leu 

575 580 585 

His Leu Thr Ala Leu Phe Leu Thr Val Tyr Glu Trp Arg Ser Pro 

590 595 600 
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Tyr 


Gly 


Leu 


Thr 


Pro 
605 


Arg 


Gly 


Tyr 


Ser 


Ser 


Ala 


Leu 
620 


Asn 


Leu 


Thr 


Val 


Ser 


Ser 


Lys 
635 


Thr 


Pro 


Met 


Asn 


Leu 


Trp 


Ala 
650 


He 


Phe 


Thr 


Ala 


Asn 


Leu 


Ala 
665 


Ala 


Val 


Glu 


Leu 


Ser 


Gly 


He 
680 


His 


Asp 


Gly 


Phe 


Arg 


Phe 


Gly 
695 


Thr 


Val 


He 


Lys 


Lys 


Ser 


Phe 
710 


Pro 


Asp 


Ser 


Ala 


Pro 


Thr 


Thr 
725 


Pro 


Arg 


Pro 


Pro 


Lys 


Leu 


Asn 
740 


Ala 


Phe 


Tyr 


Glu 


Val 


Ser 


He 
755 


Asp 


Ala 


Lys 


Pro 


Phe 


Ala 


He 
770 


Glu 


Gly 


Ser 


Pro 


Leu 


Thr 


Ser 
785 


Asn 


Leu 


Ser 


Ser 


Gly 


Phe 


He 
800 


Asp 


Leu 


Val 


Pro 


Cys 


Gly 


Lys 
815 


Arg 


Val 


Met 


Ser 


He 


Tyr 


His 
830 


Phe 


Ala 


Gly 


Leu 


Gly 


Ser 


Ala 
845 


Leu 


Leu 


Phe 


Arg 


Leu 


Ala 


Leu 
860 


Pro 


Arg 


Tyr 


Trp 


Leu 


His 


Thr 
875 


Ser 


Gin 


Glu 


Pro 


Pro 


Glu 


Gly 
890 


Ser 


Lys 


Ser 


Gly 


Pro 


Glu 


Val 
905 


Glu 


Gin 


Thr 


Ala 


Pro 


Glu 


Gly 
920 


Trp 


Lys 


Glu 


Arg 


Arg 


Val 


Arg 
935 


Phe 


Leu 


Pro 


Glu 


Ala 


Asp 


Ala 
950 


Glu 


Ala 


Val 


Trp 


Leu 


Cys 


Ser 
965 


Tyr 


Gly 


Gly 


Ala 


Pro 


Gin 


Pro 
980 


Gly 


Glu 


Glu 


Val 


Ala 


Arg 


Glu 
995 


Arg 


Leu 


Gin 


Leu 


Leu 


Ala 


Gin 


Leu 


Gly 








1010 






Arg 


Leu 


Leu 


Gin Ala 
1025 


Arg 


Ala 


Ser 


Gly 


Arg 


Pro Gly 
1040 


Ser 


Gin 



<210> 25 
<211> 480 



Arg 


Asn 


Arg 


Ser Thr 


Val Phe 


Ser 






610 






615 


Cys 


Tyr 


Ala 


He Leu 


Phe Arg 


Arg 






625 






630 


Lys 


Cys 


Pro 


Thr Gly 


Arg Leu 


Leu 






640 






645 


Cys 


Leu 


Leu 


Val Leu 


Ser Ser 


Tyr 






655 






660 


Met 


Val 


Gly 


Asp Lys 


Thr Phe 


Glu 






670 






675 


Pro 


Lys 


Leu 


His His 


Pro Ala 


Gin 






685 






690 


Trp 


Glu 


Ser 


Ser Ala 


Glu Ala 


Tyr 






700 






705 


Met 


His 


Ala 


His Met 


Arg Arg His 






715 






720 


Gly 


Val 


Ala 


Met Leu 


Thr Ser 


Asp 






730 






735 


He 


Met 


Asp 


Lys Ser 


Leu Leu 


Asp 






745 






750 


Asp 


Cys 


Lys 


Leu Leu 


Thr Val 


Gly 






760 






765 


Tyr 


Gly 


He 


Gly Leu 


Pro Gin 


Asn 






775 






780 


Ser 


Glu 


Phe 


He Ser 


Arg Tyr 


Lys 






790 






795 


Leu 


His 


Asp 


Lys Trp 


Tyr Lys 


Met 






805 






810 


Phe 


Ala 


Val 


Thr Glu 


Thr Leu 


Gin 






820 






825 


Gly 


Leu 


Phe 


Val Leu 


Leu Cys 


Leu 






835 






840 


Ser 


Ser 


Leu 


Gly Glu 


His Ala 


Phe 






850 






855 


He 


Arg 


Lys 


Gly Ser 


Arg Leu 


Gin 






865 






870 


Lys 


He 


His 


Arg Ala 


Leu Asn 


Thr 






880 






885 


Glu 


Glu 


Thr 


Ala Glu 


Ala Glu 


Pro 






895 






900 


Gin 


Gin 


Gin 


Gin Gin 


Asp Gin 


Pro 






910 






915 


Arg 


Ala 


Arg 


Arg Ala 


Val Asp 


Lys 






925 






930 


Leu 


Glu 


Pro 


Ala Val 


Val Val 


Ala 






940 






945 


Glu 


Ala 


Ala 


Pro Arg 


Glu Gly 


Pro 






955 






960 


Arg 


Pro 


Pro 


Ala Ala 


Arg Pro 


Thr 






970 






975 


Leu 


Gin 


Glu 


Leu Glu 


Arg Arg 


He 






985 






990 


Arg 


Gin 


Ala 


Leu Val 


Arg Arg Gly 




1000 




1005 


Asp 


Ser 


Ala 


Arg His 


Arg Pro Arg 




1015 




1020 


Ala 


Pro 


Ala 


Glu Ala 


Pro Pro 


His 




1030 




1035 


Glu 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4717525CD1 



<400> 25 

Met Arg Gly Ser Pro Gly Asp Ala Glu Arg Arg Gin Arg Trp Gly 
15 10 15 

Arg Leu Phe Glu Glu Leu Asp Ser Asn Lys Asp Gly Arg Val Asp 
20 25 30 

Val His Glu Leu Arg Gin Gly Leu Ala Arg Leu Gly Gly Gly Asn 
35 40 45 

Pro Asp Pro Gly Ala Gin Gin Gly He Ser Ser Glu Gly Asp Ala 
50 55 60 

Asp Pro Asp Gly Gly Leu Asp Leu Glu Glu Phe Ser Arg Tyr Leu 
65 70 75 

Gin Glu Arg Glu Gin Arg Leu Leu Leu Met Phe His Ser Leu Asp 
80 85 90 

Arg Asn Gin Asp Gly His He Asp Val Ser Glu He Gin Gin Ser 
95 100 105 

Phe Arg Ala Leu Gly He Ser He Ser Leu Glu Gin Ala Glu Lys 

110 115 120 

He Leu His Ser Met Asp Arg Asp Gly Thr Met Thr He Asp Trp 

125 130 135 

Gin Glu Trp Arg Asp His Phe Leu Leu His Ser Leu Glu Asn Val 

140 145 150 

Glu Asp Val Leu Tyr Phe Trp Lys His Ser Thr Val Leu Asp He 

155 160 165 

Gly Glu Cys Leu Thr Val Pro Asp Glu Phe Ser Lys Gin Glu Lys 

170 175 180 

Leu Thr Gly Met Trp Trp Lys Gin Leu Val Ala Gly Ala Val Ala 

185 * 190 195 

Gly Ala Val Ser Arg Thr Gly Thr Ala Pro Leu Asp Arg Leu Lys 

200 205 210 

Val Phe Met Gin Val His Ala Ser Lys Thr Asn Arg Leu Asn He 

215 220 225 

Leu Gly Gly Leu Arg Ser Met Val Leu Glu Gly Gly lie Arg Ser 

230 235 240 

Leu Trp Arg Gly Asn Gly He Asn Val Leu Lys He Ala Pro Glu 

245 " 250 255 

Ser Ala He Lys Phe Met Ala Tyr Glu Gin He Lys Arg Ala He 

260 265 270 

Leu Gly Gin Gin Glu Thr Leu His Val Gin Glu Arg Phe Val Ala 

275 280 285 

Gly Ser Leu Ala Gly Ala Thr Ala Gin Thr He He Tyr Pro Met 

290 295 300 

Glu Thr Leu Lys Asn Trp Trp Leu Gin Gin Tyr Ser His Asp Ser 

305 310 315 

Ala Asp Pro Gly He Leu Val Leu Leu Ala Cys Gly Thr He Ser 

320 325 330 

Ser Thr Cys Gly Gin He Ala Ser Tyr Pro Leu Ala Leu Val Arg 

335 340 345 

Thr Arg Met Gin Ala Gin Ala Ser He Glu Gly Gly Pro Gin Leu 

350 355 360 

Ser Met Leu Gly Leu Leu Arg His He Leu Ser Gin Glu Gly Met 

365 370 375 

Arg Gly Leu Tyr Arg Gly He Ala Pro Asn Phe Met Lys Val lie 

380 385 390 

Pro Ala Val Ser lie Ser Tyr Val Val Tyr Glu Asn Met Lys Gin 

395 400 405 

Ala Leu Gly Val Thr Ser Arg Leu Glu Tyr Ser Gly Ser lie Ser 
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410 415 420 

Asp His Cys Asn Leu Cys Leu Pro Gly Ser Ser Asp Ser Pro Ala 

425 430 435 

Ser Ala Ser Arg Val Ala Gly He Thr Gly Phe His His Val Ala 

440 445 ' 450 

Gin Ala His Leu Gly Leu Val Gly Ser Arg Asn Ser Ala Ala Phe 

455 460 465 

Ser Leu Pro Thr Cys Trp Asp Tyr Arg Lys Pro Val Val Met Pro 

470 475 480 



<210> 26 
<211> 518 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5091793CD1 

<400> 26 

Met Ala Gly Leu Arg Asn Glu Ser Glu Gin Glu Pro Leu Leu Gly 
15 10 15 

Asp Thr Pro Gly Ser Arg Glu Trp Asp He Leu Glu Thr Glu Glu 
20 25 30 

His Tyr Lys Ser Arg Trp Arg Ser He Arg He Leu Tyr Leu Thr 
35 40 45 

Met Phe Leu Ser Ser Val Gly Phe Ser Val Val Met Met Ser He 
50 55 60 

Trp Pro Tyr Leu Gin Lys He Asp Pro Thr Ala Asp Thr Ser Phe 
65 70 75 

Leu Gly Trp Val He Ala Ser Tyr Ser Leu Gly Gin Met Val Ala 
80 85 90 

Ser Pro He Phe Gly Leu Trp Ser Asn Tyr Arg Pro Arg Lys Glu 
95 100 105 

Pro Leu He Val Ser He Leu He Ser Val Ala Ala Asn Cys Leu 

110 115 120 

Tyr Ala Tyr Leu His He Pro Ala Ser His Asn Lys Tyr Tyr Met 

125 130 135 

Leu Val Ala Arg Gly Leu Leu Gly He Gly Ala Gly Asn Val Ala 

140 145 150 

Val Val Arg Ser Tyr Thr Ala Gly Ala Thr Ser Leu Gin Glu Arg 

155 160 165 

Thr Ser Ser Met Ala Asn He Ser Met Cys Gin Ala Leu Gly Phe 

170 175 180 

He Leu Gly Pro Val Phe Gin Thr Cys Phe Thr Phe Leu Gly Glu 

185 190 195 

Lys Gly Val Thr Trp Asp Val He Lys Leu Gin He Asn Met Tyr 

200 205 210 

Thr Thr Pro Val Leu Leu Ser Ala Phe Leu Gly He Leu Asn He 

215 220 225 

He Leu He Leu Ala He Leu Arg Glu His Arg Val Asp Asp Ser 

230 235 240 

Gly Arg Gin Cys Lys Ser He Asn Phe Glu Glu Ala Ser Thr Asp 

245 250 255 

Glu Ala Gin Val Pro Gin Gly Asn He Asp Gin Val Ala Val Val 

260 265 270 

Ala He Asn Val Leu Phe Phe Val Thr Leu Phe He Phe Ala Leu 

275 280 285 

Phe Glu Thr He He Thr Pro Leu Thr Met Asp Met Tyr Ala Trp 

290 295 300 

Thr Gin Glu Gin Ala Val Leu Tyr Asn Gly He He Leu Ala Ala 

305 310 315 
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Leu 


Gly 


Val Glu 


Ala 
320 


Val 


Val 


lie 


Ser 


Lys 


Lys He 


Gly 
335 


Glu 


Arg 


Ala 


Val 


Val 


Trp Val 


Gly 
350 


Phe 


Phe 


lie 


Phe 


Pro 


Lys He 


Gin 
365 


Trp 


Glu 


Asp 


Asn 


Thr 


Thr Phe 


Gly 
380 


Glu 


He 


lie 


Met 


Glu 


Asp Asp 


Asn 
395 


Glu 


Arg 


Pro 


Ala 


Trp 


Cys Leu 


Tyr 
410 


Thr 


Pro 


Val 


Thr 


Ser 


Ala Val 


Leu 
425 


lie 


Gly 


Leu 


Met 


Ser 


Tyr Thr 


Leu 
440 


Tyr 


Ser 


Lys 


Gly 


Val 


Tyr Met 


Gly 
455 


Trp 


Leu 


Thr 


He 


Leu 


Gly Pro 


Met 
470 


Phe 


lie 


Ser 


Pro 


Arg 


Trp Ala 


Phe 
485 


Ser 


Leu 


Val 


He 


Thr 


Leu Leu 


Gly 
500 


Val 


Val 


Tyr 


Val 


Arg 


Tyr Gly 


Arg 
515 


He 


Gin 


Glu 



<210> 27 

<211> 501 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 5945527CD1 

<400> 27 



Met 


Arg 


Ser 


Ser 


Leu 


Ala 


Pro 


Gly 


1 








5 








Ser 


Arg 


Asp 


Ser 


Trp 
20 


Phe 


Arg 


Gly 


Leu 


lie 


Tyr 


Ala 


Cys 
35 


Tyr 


His 


Met 


Val 


Lys 


Ser 


Arg 


Leu 
50 


His 


Gin 


Asn 


He 


Asn 


Asp 


Thr 


His 
65 


Ser 


Leu 


Asn 


Ala 


Pro 


Phe 


Asp 


Lys 
80 


Asp 


Asn 


Tyr 


Asp 


Asn 


Ala 


Phe 


Leu 
95 


lie 


Ala 


Tyr 


Gly 


Val 


Phe 


Gly 


Glu 
110 


Arg 


Leu 


Pro 


Gly 


Met 


Leu 


Leu 


Ser 
125 


Gly 


Leu 


Phe 


Tyr 


Phe 


Trp 


Asn 


lie 
140 


His 


Glu 


Leu 


Val 


Cys 


Asn 


Gly 


Leu 
155 


Val 


Gin 


Thr 


Thr 


Cys 


Val 


Gly 


Asn 


Trp 


Phe 


Gly 



170 



Phe 


Leu 


Gly Val Lys Leu Leu 




325 


330 


lie 


Leu 


Leu Gly Gly Leu lie 




340 


345 


Leu 


Leu 


Pro Trp Gly Asn Gin 




355 


360 


Leu 


His 


Asn Asn Ser lie Pro 




370 


375 


lie 


Gly 


Leu Trp Lys Ser Pro 




385 


390 


Thr 


Gly 


Cys Ser lie Glu Gin 




400 


405 


lie 


His 


Leu Ala Gin Phe Leu 




415 


420 


Gly 


Tyr 


Pro Val Cys Asn Leu 




430 


435 


lie 


Leu 


Gly Pro Lys Pro Gin 




445 


450 


Ala 


Ser 


Gly Ser Gly Ala Arg 




460 


465 


Gin 


Val 


Tyr Ala His Trp Gly 




475 


480 


Cys 


Gly 


He lie Val Leu Thr 




490 


495 


Lys 


Arg 


Leu lie Ala Leu Ser 




505 


510 



Val 


Trp 


Phe Phe Arg Ala 


Phe 




10 




15 


Leu 


lie 


Leu Leu Leu Thr 


Phe 




25 




30 


Ser 


Arg 


Lys Pro He Ser 


lie 




40 




45 


Cys 


Ser 


Glu Gin He Lys 


Pro 




55 




60 


Asp 


Thr 


Met Trp Cys Ser Trp 




70 




75 


Lys 


Glu 


Leu Leu Gly Gly Val 




85 




90 


Ala 


lie 


Gly Met Phe lie 


Ser 




100 




105 


Leu 


Arg 


Tyr Tyr Leu Ser 


Ala 




115 




120 


Thr 


Ser 


Leu Phe Gly Leu Gly 




130 




135 


Trp 


Tyr 


Phe Val Val lie 


Gin 




145 




150 


Thr 


Gly 


Trp Pro Ser Val 


Val 




160 




165 


Lys 


Gly 


Lys Arg Gly Phe 


lie 




175 




180 



42/83 



Met Gly 


He Trp 


Asn 


Ser 


His 


Thr 






185 








Ser Leu 


He Ala 


Gly 


He 


Trp 


Val 






200 








Phe He 


Val Pro 


Gly 


He 


He 


Thr 






215 








Phe Leu 


Phe Leu 


He 


Glu 


His 


Pro 






230 








Pro Gin 


His His 


Gly 


Glu 


Pro 


Ala 






245 








Asp Pro 


Gly Asn 


Ser 


Pro 


Cys 


Ser 






260 








Thr Val 


Ala Lys 


Cys 


Ser 


Lys 


Gly 






275 








He Ser 


Phe Phe 


Gly 


Ala 


Leu 


Arg 






290 








Ser Leu 


Cys Leu 


Leu 


Phe 


Ala 


Lys 






305 








Tyr Trp 


Leu Pro 


Leu 


Tyr 


He 


Ala 






320 








Lys Glu 


Ala Gly 


Asp 


Leu 


Ser 


Thr 






335 








He Gly 


Gly He 


Val 


Ala 


Gly 


Leu 






350 








Arg Ala 


Thr Thr 


Cys 


Cys 


Val 


Met 






365 








Met Phe 


Leu Tyr 


Asn 


Tyr 


He 


Gly 






380 








He Val 


Met Leu 


He 


He 


Cys 


Gly 






395 








Ala Leu 


He Thr 


Thr 


Ala 


Val 


Ser 






410 








Ser Leu 


Lys Gly 


Asn 


Ala 


Lys 


Ala 






425 








He Asp 


Gly Thr 


Gly 


Ser 


He 


Gly 






440 








Ala Gly 


Leu He 


Ser 


Pro 


Thr 


Gly 






455 








Leu He 


Ser Ala 


Asp 


Val 


Leu 


Ala 






470 








Val Tyr 


Lys Glu 


He 


Leu 


Ala 


Trp 






485 








Ser Gly 


Tyr Lys 


Glu 


He 







500 

<210> 28 
<211> 801 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6941124CD1 

<400> 28 

Met Gin Ala His Asn Thr Glu Asn 

1 5 
Val Leu Leu His Thr Ser Arg Lys 
20 

Lys Glu Glu He Arg Ala His Arg 
35 

Gin Ala Ser He Val Leu Asn Glu 
50 



Ser 


Val 


Gly Asn He 


Leu 


Gly 




190 






195 


Asn Gly 


Gin Trp Gly 


Leu 


c _ 

ser 




205 






zlU 


Ala 


Val 


Met Gly Val 


Ti- 
ne 


Thr 




220 






*1 1 c 

2 2b 


Glu 


Asp 


Val Asp Cys 


Ala 


Pro 




235 






240 


Glu 


Asn 


Gin Asp Asn 


Pro 


Glu 




250 






255 


He Arg 


Glu Ser Gly 


Leu 


Glu 




265 






270 


Pro Cys 


Glu Glu Pro 


Ala 


Ala 




280 






285 


He 


Pro 


Gly Val Val 


Glu 


Phe 




295 






300 


Leu Val 


Ser Tyr Thr 


Phe 


Leu 




310 






315 


Asn 


Val 


Ala His Phe 


Ser 


Ala 




325 






330 


Leu 


Phe 


Asp Val Gly 


Gly 


He 




340 






345 


Val 


Ser 


Asp Tyr Thr 


Asn Gly 




355 






360 


Leu 


He 


Leu Ala Ala 


Pro 


Met 




370 






375 


Gin 


Asp 


Gly He Ala 


Ser 


Ser 




385 






390 


Gly Leu 


Val Asn Gly 


Pro 


Tyr 




400 






405 


Ala 


Asp 


Leu Gly Thr 


His 


Lys 




415 






420 


Leu 


Ser 


Thr Val Thr 


Ala 


He 




430 






435 


Ala 


Ala 


Leu Gly Pro 


Leu 


Leu 




445 






450 


Trp 


Asn 


Asn Val Phe 


Tyr 


Met 




460 






465 


Cys 


Leu 


Leu Leu Cys 


Arg 


Leu 




475 






480 


Lys 


Val 


Ser Leu Ser 


Arg Gly 


490 






495 



Glu 


Ala 


Thr 


Ser 


Gly Gly Cys 




10 






15 


Tyr 


Leu 


Lys 


Leu 


Lys Asn Phe 




25 






30 


Asp 


Leu 


Asp Gly 


Phe Leu Ala 




40 






45 


Thr 


Ala 


Thr 


Ser 


Leu Asp Asn 




55 






60 
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Val 


Leu 


Arg 


Thr 


Met 


Leu 


Arg 


Arg 


Phe Ala Arg Asp Pro Asp Asn 










65 








70 75 


Asn 


Glu 


Pro 


Asn 


Cys 


Asn 


Leu 


Asp 


Leu Leu Met Ala Met Leu Phe 










80 








85 90 


Thr 


Asp 


Ala 


Gly 


Ala 


Pro 


Met 


Arg 


Gly Lys Val His Leu Leu Ser 










95 








100 105 


Asp 


Thr 


He 


Gin 


Gly 


Val 


Thr 


Ala 


Thr Val Thr Gly Val Arg Tyr 










110 








115 120 


Gin 


Gin 


Ser 


Trp 


Leu 


Cys 


lie 


lie 


Cys Thr Met Lys Ala Leu Gin 










125 








130 135 


Lys 


Arg 


His 


Val 


Cys 


lie 


Ser 


Arg 


Leu Val Arg Pro Gin Asn Trp 










140 








145 150 


Gly 


Glu 


Asn 


Ser 


Cys 


Glu Val 


Arg 


Phe Val He Leu Val Leu Ala 










155 








160 165 


Pro 


Pro 


Lys 


Met 


Lys 


Ser 


Thr 


Lys 


Thr Ala Met Glu Val Ala Arg 










170 








175 180 


Thr 


Phe 


Ala 


Thr 


Met 


Phe 


Ser 


Asp 


lie Ala Phe Arg Gin Lys Leu 










185 








190 195 


Leu 


Glu 


Thr 


Arg 


Thr 


Glu 


Glu 


Glu 


Phe Lys Glu Ala Leu Val His 










200 








205 210 


Gin 


Arg 


Gin 


Leu 


Leu 


Thr 


Met 


Val 


Ser His Gly Pro Val Ala Pro 










215 








220 225 


Arg 


Thr 


Lys 


Glu 


Arg 


Ser 


Thr 


Val 


Ser Leu Pro Ala His Arg His 










230 








235 240 


Pro 


Glu 


Pro 


Pro 


Lys 


Cys 


Lys 


Asp 


Phe Val Pro Phe Gly Lys Gly 










245 








250 255 


He 


Arg 


Glu 


Asp 


He 


Ala 


Arg 


Arg 


Phe Pro Leu Tyr Pro Leu Asp 










260 








265 270 


Phe 


Thr 


Asp 


Gly 


lie 


lie Gly 


Lys 


Asn Lys Ala Val Gly Lys Tyr 










275 








280 285 


He 


Thr 


Thr 


Thr 


Leu 


Phe 


Leu 


Tyr 


Phe Ala Cys Leu Leu Pro Thr 










290 








295 300 


He 


Ala 


Phe 


Gly 


Ser 


Leu 


Asn 


Asp 


Glu Asn Thr Asp Gly Ala lie 










305 








310 315 


Asp 


Val 


Gin 


Lys 


Thr 


lie 


Ala 


Gly 


Gin Ser lie Gly Gly Leu Leu 










320 








325 330 


Tyr 


Ala 


Leu 


Phe 


Ser 


Gly Gin 


Pro 


Leu Val He Leu Leu Thr Thr 








335 








340 345 


Ala 


Pro 


Leu 


Ala 


Leu 


Tyr 


lie 


Gin 


Val lie Arg Val lie Cys Asp 










350 








355 360 


Asp 


Tyr 


Asp 


Leu 


Asp 


Phe 


Asn 


Ser 


Phe Tyr Ala Trp Thr Gly Leu 










365 








370 375 


Trp 


Asn 


Ser 


Phe 


Phe 


Leu 


Ala 


Leu 


Tyr Ala Phe Phe Asn Leu Ser 










380 








385 390 


Leu 


Val 


Met 


Ser 


Leu 


Phe 


Lys 


Arg 


Ser Thr Glu Glu lie lie Ala 










395 








400 405 


Leu 


Phe 


He 


Ser 


lie 


Thr 


Phe 


Val 


Leu Asp Ala Val Lys Gly Thr 










410 








415 420 


Val 


Lys 


He 


Phe 


Trp 


Lys 


Tyr 


Tyr 


Tyr Gly His Tyr Leu Asp Asp 










425 








430 435 


Tyr 


His 


Thr 


Lys 


Arg 


Thr 


Ser 


Ser 


Leu Val Ser Leu Ser Gly Leu 










440 








445 450 


Gly 


Ala 


Ser 


Leu 


Asn 


Ala 


Ser 


Leu 


His Thr Ala Leu Asn Ala Ser 








455 








460 465 


Phe 


Leu 


Ala 


Ser 


Pro 


Thr 


Glu 


Leu 


Pro Ser Ala Thr His Ser Gly 










470 








475 480 


Gin 


Ala 


Thr 


Ala 


Val 


Leu 


Ser 


Leu 


Leu lie Met Leu Gly Thr Leu 










485 








490 495 


Trp 


Leu 


Gly 


Tyr 


Thr 


Leu Tyr 


Gin 


Phe Lys Lys Ser Pro Tyr Leu 










500 








505 510 


His 


Pro 


Cys 


Val 


Arg 


Glu 


He 


Leu 


Ser Asp Cys Ala Leu Pro lie 








515 








520 525 


Ala 


Val 


Leu 


Ala 


Phe 


Ser 


Leu 


lie 


Ser Ser His Gly Phe Arg Glu 
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530 






He Glu 


Met Ser 


Lys Phe 


Arg Tyr 






545 






Ala Met 


Ala Gin 


He Gin 


Ser 


Leu 






560 






Ala Met 


Gly Leu 


Gly Phe 


Leu 


Leu 






575 






Gin Asn 


Leu Val 


Ala Ala 


Leu Val 






590 






Val Lys 


Gly Thr 


Ala Tyr 


His 


Trp 






605 






He Asn 


Thr Gly 


Leu Ser 


Leu 


Phe 






620 






Ala Tyr 


Pro His 


Ser Pro 


Leu 


His 






635 






Glu Glu 


Arg Val 


Glu Asn 


Gly His 






650 






Val Lys 


Glu Thr 


Arg Leu 


Thr 


Ser 






665 






Gly Leu 


Ser Leu 


Leu Leu 


Leu 


Pro 






680 






Lys Pro 


Val Leu 


Tyr Gly 


Leu 


Phe 






695 






Leu Asp 


Gly Asn 


Gin Leu 


Val 


Gin 






710 






Glu Gin 


Thr Ala 


Tyr Pro 


Pro 


Thr 






725 






Gin Arg 


Lys He 


His Tyr 


Phe 


Thr 






740 






Leu Leu 


Leu Cys 


Ala Phe 


Gly Met 






755 






Met He 


Phe Pro 


Leu He 


Met 


He 






770 






He Leu 


Leu Pro 


Arg He 


He 


Glu 






785 






Asp Ala 


Glu His 


Arg Pro 










800 
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<400> 29 



Met 


Ser 


He 


Phe 


Leu Leu 


Leu 


Arg 


1 








5 






His 


Arg 


Phe 


Gly 


Trp Ala 


Gly 


Leu 










20 






Leu 


Ser 


Thr 


Ala 


Tyr Leu 


Gly 


Leu 










35 






Leu 


Thr 


Ala 


Ala 


Ala Thr 


Phe 


lie 










50 






Leu 


Thr 


Val 


Gly 


Tyr Gly 


Asp 


Leu 










65 






Arg 


He 


Phe 


Val 


Ala Ala 


Trp 


Val 










80 






Leu 


Thr 


Ala 


Ala 


He Gly 


Lys 


Thr 










95 






Trp 


Arg 


Lys 


Gly 


Met Lys 


Gly 


Lys 





535 






540 


Asn 


Pro 


Ser 


Glu Ser 


Pro Phe 




550 






555 


Ser 


Leu 


Arg Ala Val 


Ser Gly 




565 






570 


Ser 


Met 


Leu 


Phe Phe 


He Glu 




580 






585 


Asn 


Ala 


Pro Glu Asn Arg Leu 




595 






600 


Asp 


Leu 


Leu 


Leu Leu 


Ala He 




610 






615 


Gly Leu 


Pro Trp He 


His Ala 




625 






630 


Val 


Arg 


Ala 


Leu Ala 


Leu Val 




640 






645 


He 


Tyr 


Asp 


Thr He 


Val Asn 




655 






660 


Leu Gly 


Ala 


Ser Val 


Leu Val 




670 






675 


Val 


Pro 


Leu 


Gin Trp 


He Pro 




685 






690 


Leu 


Tyr 


He 


Ala Leu 


Thr Ser 




700 






705 


Arg 


Val 


Ala 


Leu Leu 


Leu Lys 




715 






720 


His 


Tyr 


He 


Arg Arg 


Val Pro 




730 






735 


Gly Leu 


Gin Val Leu 


Gin Leu 




745 






750 


Ser 


Ser 


Leu 


Pro Tyr Met Lys 




760 






765 


Ala Met 


He Pro He Arg Tyr 




775 






780 


Ala 


Lys 


Tyr Leu Asp Val Met 




790 






795 



Met Tyr Ala Ser Ser Leu Phe 

10 15 
He Val Ala Leu Gly Val His 

25 30 
Val Leu Leu Gly Glu Gin His 

40 45 
Tyr Phe Tyr Leu Thr Thr Thr 

55 60 
Ala Pro Gin Thr Ser Ala Gly 

70 75 
Met Leu Gly Gly He Ala Leu 

85 90 
Thr Ser Ser Val He Asp Ala 
100 105 
Gly Asp Phe Thr Gly Lys Val 
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Gly His 
Val He 
He Val 
Lys Ala 
Leu Leu 
Thr Pro 
Gin Leu 
Glu He 
Thr Ser 
Pro Gly 
Gly Ala 
Thr Phe 
Thr Leu 
Asn Pro 
He Ala 



110 

Thr Val Leu 
125 

Glu Leu Leu 
140 

He Cys Asp 
155 

Ala Phe He 
170 

Arg Ala Gly 
185 

Ser Asp Asp 
200 

Ser Pro Val 
215 

Ala Ala Leu 
230 

Ser Met Ala 
245 

Ser Ser Val 
260 

Thr Gin Tyr 
275 

Gly Glu Leu 
290 

He Gly Tyr 
305 

Pro Ser Ala 
320 

Ser Thr Arg 
335 



He Gly 
Leu Gin 
Cys Ser 
Arg Gly 
Val Pro 
Leu Thr 
Gly His 
Ala Ser 
He Glu 
Val He 
Leu Met 
Tyr Thr 
Arg Ala 
Thr Glu 
Leu Lys 



Trp Glu 
Asp Glu 
Leu Glu 
Glu Ser 
Gly Ala 
Leu Ala 
Val Val 
Ser Tyr 
Met Leu 
Asn Glu 
Lys Leu 
Gin Met 
Lys Gly 
Val Lys 
Glu He 



115 

Gly Ala 
130 

Thr Ser 
145 

Glu Asn 
160 

Leu Ser 
175 

Glu Arg 
190 

Thr Val 
205 

Ala His 
220 

Ala Pro 
235 

Val Arg 
250 

Leu Leu 
265 

Pro Glu 
280 

Lys Glu 
295 

Val Gin 
310 

Gly Gly 
325 

Ser His 
340 



Ser Ser 
Asn Asp 
Pro Met 
Ser Thr 
Val Leu 
Leu Ala 
Phe Asn 
Arg Leu 
Ala Ser 
Cys Val 
Ala Phe 
Arg His 
Gin Pro 
Glu Leu 
Gly Met 



120 
Glu Arg 

135 
Asn Leu 

150 
Pro Gly 

165 
Ala Leu 

180 
Val Arg 

195 
Val Asn 

210 
Glu Ser 

225 
Glu Cys 

240 
Gin Asp 

255 
Gly Gin 

270 
Glu Ala 

285 
Asn Ala 

300 
Ser He 

315 
Phe Tyr 

330 

Ala 
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<400> 30 



Met 


His 


Leu 


Leu 


He 


He 


Tyr 


lie Gin Phe Gin He Ala Glu Glu 


1 








5 




10 15 


Gly 


Ser 


Thr 


He 


Ser Cys 


Val 


Val Glu Arg Thr Arg Gly Ala Leu 








20 






25 30 


Asp 


Tyr 


Val 


His 


Val 


Phe 


Tyr 


Thr lie Ser Gin lie Glu Thr Asp 






35 






40 45 


Gly 


He 


Asn 


Tyr 


Leu 


Val 


Asp 


Asp Phe Ala Asn Ala Ser Gly Thr 






50 






55 60 


He 


Thr 


Phe 


Leu 


Pro 


Trp 


Gin 


Arg Ser Glu Val Leu Asn lie Tyr 










65 






70 75 


Val 


Leu 


Asp 


Asp 


Asp 


He 


Pro 


Glu Leu Asn Glu Tyr Phe Arg Val 






80 






85 . 90 


Thr 


Leu 


Val 


Ser 


Ala 


He 


Pro 


Gly Asp Gly Lys Leu Gly Ser Thr 










95 






100 105 


Pro 


Thr 


Ser 


Gly 


Ala 


Ser 


He 


Asp Pro Glu Lys Glu Thr Thr Asp 








110 






115 120 


lie 


Thr 


He 


Lys 


Ala 


Ser 


Asp 


His Pro Tyr Gly Leu Leu Gin Phe 








125 






130 135 


Ser 


Thr 


Gly 


Leu 


Pro 


Pro 


Gin 


Pro Lys Asp Ala Met Thr Leu Pro 








140 






145 150 


Ala 


Ser 


Ser 


Val 


Pro 


His 


lie 


Thr Val Glu Glu Glu Asp Gly Glu 
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155 160 165 

He Arg Leu Leu Val He Arg Ala Gin Gly Leu Leu Gly Arg Val 

170 175 180 

Thr Ala Glu Phe Arg Thr Val Ser Leu Thr Ala Phe Ser Pro Glu 

185 190 195 

Asp Tyr Gin Asn Val Ala Gly Thr Leu Glu Phe Gin Pro Gly Glu 

200 205 210 

Arg Tyr Lys Tyr He Phe lie Asn He Thr Asp Asn Ser He Pro 

215 220 225 

Glu Leu Glu Lys Ser Phe Lys Val Glu Leu Leu Asn Leu Glu Gly 

230 235 240 

Gly Ala Ser Leu Gly Val Ala Ser Gin He Leu Val Thr He Ala 

245 250 255 

Ala Ser Asp His Ala His Gly Val Phe Glu Phe Ser Pro Glu Ser 

260 265 270 

Leu Phe Val Ser Gly Thr Glu Pro Glu Asp Gly Tyr Ser Thr Val 

275 280 285 

Thr Leu Asn Val He Arg His His Gly Thr Leu Ser Pro Val Thr 

290 295 300 

Leu His Trp Asn He Asp Ser Asp Pro Asp Gly Asp Leu Ala Phe 

305 310 315 

Thr Ser Gly Asn He Thr Phe Glu He Gly Gin Thr Ser Ala Asn 

320 325 330 

He Thr Val Glu He Leu Pro Asp Glu Asp Pro Glu Leu Asp Lys 

335 340 345 

Ala Phe Ser Val Ser Val Leu Ser Val Ser Ser Gly Ser Leu Gly 

350 355 360 

Ala His He Asn Ala Thr Leu Thr Val Leu Ala Ser Asp Asp Pro 

365 370 375 

Tyr Gly He Phe He Phe Ser Glu Lys Asn Arg Pro Val Lys Val 

380 385 390 

Glu Glu Ala .Thr Gin Asn He Thr Leu Ser He He Arg Leu Lys 

395 400 405 

Gly Leu Met Gly Lys Val Leu Val Ser Tyr Ala Thr Leu Asp Asp 

410 415 420 

Met Glu Lys Pro Pro Tyr Phe Pro Pro Asn Leu Ala Arg Ala Thr 

425 430 435 

Gin Gly Arg Asp Tyr He Pro Ala Ser Gly Phe Ala Leu Phe Gly 

440 445 450 

Ala Asn Gin Ser Glu Ala Thr He Ala He Ser He Leu Asp Asp 

455 460 465 

Asp Glu Pro Glu Arg Ser Glu Ser Val Phe He Glu Leu Leu Asn 

470 475 480 

Ser Thr Leu Val Ala Lys Val Gin Ser Arg Ser He Pro Asn Ser 

485 490 495 

Pro Arg Leu Gly Pro Lys Val Glu Thr He Ala Gin Leu He He 

500 505 510 

He Ala Asn Asp Asp Ala Phe Gly Thr Leu Gin Leu Ser Ala Pro 

515 520 525 

He Val Arg Val Ala Glu Asn His Val Gly Pro He He Asn Val 

530 535 540 

Thr Arg Thr Gly Gly Ala Phe Ala Asp Val Ser Val Lys Phe Lys 

545 550 555 

Ala Val Pro He Thr Ala He Ala Gly Glu Asp Tyr Ser He Ala 

560 565 570 

Ser Ser Asp Val Val Leu Leu Glu Gly Glu Thr Ser Lys Ala Val 

575 580 585 

Pro He Tyr Val He Asn Asp He Tyr Pro Glu Leu Glu Glu Ser 

590 595 600 

Phe Leu Val Gin Leu Met Asn Glu Thr Thr Gly Gly Ala Arg Leu 

605 610 615 

Gly Ala Leu Thr Glu Ala Val He He He Glu Ala Ser Asp Asp 

620 625 630 
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Pro Tyr Gly Leu Phe Gly Phe Gin He Thr Lys Leu He Val Glu 

635 640 645 

Glu Pro Glu Phe Asn Ser Val Lys Val Asn Leu Pro He He Arg 

650 655 660 

Asn Ser Gly Thr Leu Gly Asn Val Thr Val Gin Trp Val Ala Thr 

665 670 675 

He Asn Gly Gin Leu Ala Thr Gly Asp Leu Arg Val Val Ser Gly 

680 685 690 

Asn Val Thr Phe Ala Pro Gly Glu Thr He Gin Thr Leu Leu Leu 

695 700 705 

Glu Val Leu Ala Asp Asp Val Pro Glu He Glu Glu Val He Gin 

710 715 720 

Val Gin Leu Thr Asp Ala Ser Gly Gly Gly Thr He Gly Leu Asp 

725 730 735 

Arg He Ala Asn He He He Pro Ala Asn Asp Asp Pro Tyr Gly 

740 745 750 

Thr Val Ala Phe Ala Gin Met Val Tyr Arg Val Gin Glu Pro Leu 

755 760 765 

Glu Arg Ser Ser Cys Ala Asn He Thr Val Arg Arg Ser Gly Gly 

770 775 780 

His Phe Gly Arg Leu Leu Leu Phe Tyr Ser Thr Ser Asp He Asp 

785 790 795 

Val Val Ala Leu Ala Met Glu Glu Gly Gin Asp Leu Leu Ser Tyr 

800 805 810 

Tyr Glu Ser Pro He Gin Gly Val Pro Asp Pro Leu Trp Arg Thr 

815 820 825 

Trp Met Asn Val Ser Ala Val Gly Glu Pro Leu Tyr Thr Cys Ala 

830 835 840 

Thr Leu Cys Leu Lys Glu Gin Ala Cys Ser Ala Phe Ser Phe Phe 

845 850 855 

Ser Ala Ser Glu Gly Pro Gin Cys Phe Trp Met Thr Ser Trp He 

860 865 870 

Ser Pro Ala Val Asn Asn Ser Asp Phe Trp Thr Tyr Arg Lys Asn 

875 880 885 

Met Thr Arg Val Ala Ser Leu Phe Ser Gly Gin Ala Val Ala Gly 

890 895 900 

Ser Asp Tyr Glu Pro Val Thr Arg Gin Trp Ala He Met Gin Glu 

905 910 915 

Gly Asp Glu Phe Ala Asn Leu Thr Val Ser He Leu Pro Asp Asp 

920 925 930 

Phe Pro Glu Met Asp Glu Ser Phe Leu He Ser Leu Leu Glu Val 

935 940 945 

His Leu Met Asn He Ser Ala Ser Leu- Lys Asn Gin Pro Thr He 

950 955 960 

Gly Gin Pro Asn He Ser Thr Val Val He Ala Leu Asn Gly Asp 

965 970 975 

Ala Phe Gly Val Phe Val He Tyr Asn He Ser Pro Asn Thr Ser 

980 985 990 

Glu Asp Gly Leu Phe Val Glu Val Gin Glu Gin Pro Gin Thr Leu 

995 1000 1005 

Val Glu Leu Met He His Arg Thr Gly Gly Ser Leu Gly Gin Val 
1010 1015 1020 

Ala Val Glu Trp Arg Val Val Gly Gly Thr Ala Thr Glu Gly Leu 
1025 1030 1035 

Asp Phe He Gly Ala Gly Glu He Leu Thr Phe Ala Glu Gly Glu 
1040 1045 1050 

Thr Lys Lys Thr Val He Leu Thr He Leu Asp Asp Ser Glu Pro 
1055 1060 1065 

Glu Asp Asp Glu Ser He He Val Ser Leu Val Tyr Thr Glu Gly 
1070 1075 1080 

Gly Ser Arg He Leu Pro Ser Ser Asp Thr Val Arg Val Asn He 
1085 1090 1095 

Leu Ala Asn Asp Asn Val Ala Gly He Val Ser Phe Gin Thr Ala 
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1100 








1105 






1110 


Ser Arg 


Ser 


Val He 


Gly 


His 


Glu 


Gly Glu He 


Leu 


bin 


Phe His 






1115 








1120 






llZD 


Val lie 


Arg 


Thr Phe 


Pro 


Gly 


Arg 


Gly Asn Val 


i nr 


vai 


Asn Trp 






1130 








1135 






114U 


Lys lie 


lie 


Gly Gin 


Asn 


Gin 


GlU 


Leu Asn Phe 


ft 1 a 
Ala 


Asn 


Phe Ser 






1145 








1150 






1133 


Gly Gin 


Leu 


Phe Phe 


Pro 


Glu 


Gly 


Ser Leu Asn 


Tnr 


inr 


Leu Phe 




1160 








1165 






11 /u 


Val His 


Leu 


Leu Asp 


Asp 


Asn 


lie 


Pro Glu Glu 


Lys 


Glu 


Val Tyr 






1175 








1180 






1185 


Gin Val 


He 


Leu Tyr 


Asp 


Val 


Arg 


Thr Gin Gly 


Val 


Pro 


Pro Ala 






1190 








1195 






1200 


Gly He 


Ala 


Leu Leu 


Asp 


Ala 


Gin 


Gly Tyr Ala 


Ala 


Val 


Leu Thr 






1205 








1210 






1215 


Val Glu 


Ala 


Ser Asp 


Glu 


Pro 


His 


Gly Val Leu 


Asn 


Phe 


Ala Leu 






1220 








1225 






1230 


Ser Ser 


Arg 


Phe Val 


Leu 


Leu 


Gin 


Glu Ala Asn 


He 


Thr 


He Gin 




1235 








1240 






1245 


Leu Phe 


He 


Asn Arg 


Glu 


Phe 


Gly 


Ser Leu Gly 


Ala 


He 


Asn Val 






1250 








1255 






1260 


Thr Tyr 


Thr 


Thr Val 


Pro 


Gly 


Met 


Leu Ser Leu 


Lys 


Asn 


Gin Thr 




1265 








1270 






1275 


Val Gly 


Asn 


Leu Ala 


Glu 


Pro 


Glu 


Val Asp Phe 


Val 


Pro 


He He 




1280 








1285 






1290 


Gly Phe 


Leu 


He Leu 


Glu 


Glu 


Gly 


Glu Thr Ala 


Ala 


Ala 


He Asn 




1295 








1300 






1305 


lie Thr 


He 


Leu Glu 


Asp 


Asp 


Val 


Pro Glu Leu 


Glu 


Glu 


Tyr Phe 






1310 






1315 






1320 


Leu Val 


Asn 


Leu Thr 


Tyr 


Val 


Gly 


Leu Thr Met 


Ala 


Ala 


Ser Thr 






1325 






1330 






1335 


Ser Phe 


Pro 


Pro Arg 


Leu 


Asp 


Ser 


Glu Gly Leu 


Thr 


Ala 


Gin Val 






1340 








1345 






1350 


He He 


Asp 


Ala Asn 


Asp 


Gly 


Ala 


Arg Gly Val 


He 


Glu 


Trp Gin 




1355 








1360 






1365 


Gin Ser 


Arg 


Phe Glu 


Val 


Asn 


Glu 


Thr His Gly 


Ser 


Leu 


Thr Leu 




1370 








1375 






1380 


Val Ala 


Gin 


Arg Ser 


Arg 


Glu 


Pro 


Leu Gly His 


Val 


Ser 


Leu Phe 






1385 








1390 






1395 


Val Tyr 


Ala 


Gin Asn 


Leu 


Glu 


Ala 


Gin Val Gly 


Leu 


Asp 


Tyr He 




1400 








1405 






1410 


Phe Thr 


Pro 


Met He 


Leu 


His 


Phe 


Ala Asp Gly 


Glu 


Arg Tyr Lys 






1415 








1420 






1425 


Asn Val 


Asn 


He Met 


He 


Leu 


Asp 


Asp Asp He 


Pro Glu Gly Asp 






1430 








1435 






1440 


Glu Lys 


Phe 


Gin Leu 


He 


Leu 


Thr 


Asn Pro Ser 


Pro Gly Leu Glu 




1445 








1450 






1455 


Leu Gly 


Lys 


Asn Thr 


He 


Ala 


Leu 


He He Val 


Leu 


Ala 


Asn Asp 


1460 








1465 






1470 


Asp Gly 


Pro 


Gly Val 


Leu 


Ser 


Phe 


Asn Asn Ser 


Glu 


His 


Phe Phe 




1475 








1480 






1485 


Leu Arg 


Glu 


Pro Thr 


Ala 


Leu 


Tyr 


Val Gin Glu 


Ser Val 


Ala Val 




1490 








1495 






1500 


Leu Tyr 


He 


Val Arg 


Glu 


Pro 


Ala 


Gin Gly Leu 


Phe Gly Thr Val 




1505 








1510 






1515 


Thr Val 


Gin 


Phe He 


Val 


Thr 


Glu 


Val Asn Ser 


Ser 


Asn 


Glu Ser 






1520 








1525 






1530 


Lys Asp 


Leu 


Thr Pro 


Ser 


Lys 


Gly 


Tyr He Val 


Leu Glu Glu Gly 






1535 








1540 






1545 


Val Arg 


Phe 


Lys Ala 


Leu 


Gin 


He 


Ser Ala He 


Leu 


Asp 


Thr Glu 




1550 








1555 






1560 


Pro Glu 


Met 


Asp Glu 


Tyr 


Phe 


Val 


Cys Thr Leu 


Phe 


Asn 


Pro Thr 






1565 








1570 






1575 
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Gly 


Gly 


Ala 


Arg Leu 








1580 


Leu 


Gin 


Asn 


Gin Ala 








1595 


Glu 


Asn 


Arg 


Ala Thr 








1610 


Val 


Tyr 


Leu 


Asn Val 








1625 


Leu 


Asn 


He 


Glu Asn 








1640 


Phe 


Ser 


Pro 


Tyr Phe 








1655 


Lys 


Pro 


Ser 


Leu Asn 








1670 


Leu 


Phe 


Leu 


Val Gin 








1685 


Arg 


Tyr 


Phe 


Thr Ser 








1700 


Gin 


Arg 


Asp 


Asp Ser 








1715 


Gly 


Ser 


Phe 


Val Leu 








1730 


Thr 


Val 


Ala 


Leu Phe 








1745 


Ser 


Gin 


Ala 


Asn Ala 








1760 


Gly 


Ser 


Gly 


Phe He 








1775 


Thr 


Glu 


Val 


Glu Ala 








1790 


Phe 


Ala 


Lys 


Asn Val 








1805 


Phe 


lie 


Trp 


Glu Met 








1820 


Val 


Asp 


Phe 


Ala Ala 








1835 


Ser 


Gly 


He 


Ala His 








1850 


Leu 


Tyr 


Cys 


Trp Asn 








1865 


Glu 


Val 


Pro 


Ser Ala 








1880 


Leu 


Asn 


Ser 


Ser Lys 








1895 


His 


He 


Tyr 


Glu Leu 








1910 


Pro 


Ser 


Ser 


Gly Glu 








1925 


Thr 


He 


Ala 


Val Asn 








1940 


Glu 


Ser 


Phe 


Lys Val 








1955 


lie 


Gly 


He 


Asn Asp 








1970 


Asp 


Ala 


Tyr 


Gly He 








1985 


Gin 


Val 


Glu 


Glu Met 








2000 


Glu 


Arg 


Leu 


Lys Gly 








2015 


Ala 


Asp 


Gly 


Ser lie 








2030 


Glu 


Lys 


Arg 


Met Ser 



Gly 


Val 


His 


Val Gin 






1585 


Pro 


Leu 


Gly 


Leu Phe 








1600 


Ser 


He 


Asp 


He Glu 








1615 


Ser 


Arg 


Thr 


Asn Gly 








1630 


Pro 


Lys 


Thr 


Cys Glu 








1645 


Val 


He 


Thr 


His Glu 








1660 


Ser 


Val 


Phe 


Thr Phe 








1675 


Thr 


He 


He 


He Leu 








1690 


Asp 


Ser 


Gin 


Asp Tyr 








1705 


Glu 


Leu 


Thr 


Gin Val 








1720 


His 


Gin 


Lys 


Leu Pro 








1735 


Asn 


Lys 


Gly 


Gly Ser 








1750 


Arg 


Leu 


Asn 


Ser Leu 








1765 


Asn 


Phe 


Gin 


Glu Val 








1780 


Leu 


Ser 


Ser 


Ala Asn 








1795 


Phe 


Leu 


Gly 


Asp Gin 








1810 


Gly 


Gin 


Ser 


Ser Phe 








1825 


Val 


Asn 


Arg 


He His 








1840 


He 


Leu 


Leu 


He Gly 








1855 


Ser 


Glu 


Arg 


Asn Gin 








1870 


Tyr 


Asp 


Val 


Ala Ser 








1885 


Asn 


Leu 


He 


Ala Leu 








1900 


Ala 


Tyr 


He 


Ser Ser 








1915 


Leu 


He 


Phe 


Glu Pro 








1930 


He 


Leu 


Asp 


Asp Thr 








1945 


Gin 


Leu 


Lys 


Asn Pro 








1960 


Ser 


Val 


Thr 


He Thr 








1975 


Val 


Ala 


Phe 


Ala Gin 








1990 


Glu 


Gin 


Asp 


Ser Leu 








2005 


Thr 


Tyr 


Gly 


Arg He 








2020 


Ser 


Asp 


He 


Phe Pro 








2035 


Ala 


Lys 


He 


Leu Phe 



Thr 


Leu 


lie 


Thr Val 








1590 


Ser 


He 


Ser 


Ala Val 








1605 


Glu 


Ala 


Asn 


Arg Thr 








1620 


He 


Asp 


Leu 


Ala Asp 








1635 


Ala 


Phe 


Asn 


He Gly 








1650 


Glu 


Arg 


Asn 


Glu Glu 








1665 


Thr 


Ser 


Gly 


Phe Lys 








1680 


Glu 


Ser 


Ser 


Gin Val 








1695 


Leu 


He 


He 


Ala Ser 








1710 


Phe 


Arg 


Trp 


Asn Gly 








1725 


Val 


Arg 


Gly 


Val Leu 








1740 


Val 


Phe 


Leu 


Ala He 








1755 


Leu 


Phe 


Arg 


Trp Ser 








1770 


Pro 


Val 


Ser 


Gly Thr 








1785 


Asp 


He 


Tyr 


Leu He 








1800 


Asn 


Ser 


He 


Asp He 








1815 


Arg 


Tyr 


Phe 


Gin Ser 








1830 


Ser 


Phe 


Thr 


Pro Ala 








1845 


Gin 


Asp 


Met 


Ser Ala 








1860 


Phe 


Ser 


Phe 


Val Leu 








1875 


Val 


Thr 


Val 


Lys Ser 








1890 


Val 


Gly 


Ala 


His Ser 








1905 


His 


Ser 


Asp 


Phe He 








1920 


Gly 


Glu 


Arg 


Glu Ala 








1935 


Val 


Pro 


Glu 


Lys Glu 








1950 


Lys 


Gly 


Gly 


Ala Glu 








1965 


He 


Leu 


Ser 


Asn Asp 








1980 


Asn 


Ser 


Leu 


Tyr Lys 








1995 


Val 


Thr 


Leu 


Asn Val 








2010 


Thr 


He 


Ala 


Trp Glu 








2025 


Thr 


Ser 


Gly 


Val Val 








2040 


Thr 


Glu 


Gly 


Gin Val 
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2045 2050 2055 

Leu Ser Thr lie Thr Leu Thr lie Leu Ala Asp Asn lie Pro Glu 

2060 2065 2070 

Leu Ser Glu Val Val lie Val Thr Leu Thr Arg He Thr Thr Glu 

2075 2080 2085 

Gly Val Glu Asp Ser Tyr Lys Gly Ala Thr He Asp Gin Asp Arg 

2090 2095 2100 

Ser Lys Ser Val He Thr Thr Leu Pro Asn Asp Ser Pro Phe Gly 

2105 2110 2115 

Leu Val Gly Trp Arg Ala Ala Ser Val Phe He Arg Val Ala Glu 

2120 2125 2130 

Pro Lys Glu Asn Thr Thr Thr Leu Gin Leu Gin He Ala Arg Asp 

2135 2140 2145 

Lys Gly Leu Leu Gly Asp He Ala He His Leu Arg Ala Gin Pro 

2150 2155 2160 

Asn Phe Leu Leu His Val Asp Asn Gin Ala Thr Glu Asn Glu Asp 

2165 2170 2175 

Tyr Val Leu Gin Glu Thr He He He Met Lys Glu Asn He Lys 

2180 2185 2190 

Glu Ala His Ala Glu Val Ser He Leu Pro Asp Asp Leu Pro Glu 

2195 2200 2205 

Leu Glu Glu Gly Phe He Val Thr He Thr Glu Val Asn Leu Val 

2210 2215 2220 

Asn Ser Asp Phe Ser Thr Gly Gin Pro Ser Val Arg Arg Pro Gly 

2225 2230 2235 

Met Glu He Ala Glu He Met He Glu Glu Asn Asp Asp Pro Arg 

2240 2245 2250 

Gly He Phe Met Phe His Val Thr Arg Gly Ala Gly Glu Val He 

2255 2260 2265 

Thr Ala Tyr Glu Val Pro Pro Pro Leu Asn Val Leu Gin Val Pro 

2270 2275 2280 

Val Val Arg Leu Ala Gly Ser Phe Gly Ala Val Asn Val Tyr Trp 

2285 2290 2295 

Lys Ala Ser Pro Asp Ser Ala Gly Leu Glu Asp Phe Lys Pro Ser 

2300 2305 2310 

His Gly He Leu Glu Phe Ala Asp Lys Gin Val Thr Ala Met He 

2315 2320 2325 

Glu He Thr He He Asp Asp Ala Glu Phe Glu Leu Thr Glu Thr 

2330 2335 2340 

Phe Asn He Ser Leu He Ser Val Ala Gly . Gly Gly Arg Leu Gly 

2345 2350 2355 

Asp Asp Val Val Val Thr Val Val He Pro Gin Asn Asp Ser Pro 

2360 2365 2370 

Phe Gly Val Phe Gly Phe Glu Glu Lys Thr Val Met He Asp Glu 

2375 2380 2385 

Ser Leu Ser Ser Asp Asp Pro Asp Ser Tyr Val Thr Leu Thr Val 

2390 2395 2400 

Val Arg Ser Pro Gly Gly Lys Gly Thr Val Arg Leu Glu Trp Thr 

2405 2410 2415 

He Asp Glu Lys Ala Lys His Asn Leu Ser Pro Leu Asn Gly Thr 

2420 2425 2430 

Leu His Phe Asp Glu Thr Glu Ser Gin Lys Thr He Val Leu His 

2435 2440 2445 

Thr Leu Gin Asp Thr Val Leu Glu Glu Asp Arg Arg Phe Thr He 

2450 2455 2460 

Gin Leu He Ser He Asp Glu Val Glu He Ser Pro Val Lys Gly 

2465 2470 2475 

Ser Ala Ser He He He Arg Gly Asp Lys Arg Ala Ser Gly Glu 

2480 2485 2490 

Val Gly He Ala Pro Ser Ser Arg His He Leu He Gly Glu Pro 

2495 2500 2505 

Ser Ala Lys Tyr Asn Gly Thr Ala He He Ser Leu Val Arg Gly 

2510 2515 2520 
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Pro Gly lie Leu Gly Glu Val Thr Val Phe Trp Arg lie Phe Pro 

2525 2530 2535 

Pro Ser Val Gly Glu Phe Ala Glu Thr Ser Gly Lys Leu Thr Met 

2540 2545 2550 

Arg Asp Glu Gin Ser Ala Val He Val Val He Gin Ala Leu Asn 

2555 2560 2565 

Asp Asp He Pro Glu Glu Lys Ser Phe Tyr Glu Phe Gin Leu Thr 

2570 2575 2580 

Ala Val Ser Glu Gly Gly Val Leu Ser Glu Ser Ser Ser Thr Ala 

2585 2590 2595 

Asn He Thr Val Val Ala Ser Asp Ser Pro Tyr Gly Arg Phe Ala 

2600 2605 2610 

Phe Ser His Glu Gin Leu Arg Val Ser Glu Ala Gin Arg Val Asn 

2615 2620 2625 

He Thr He He Arg Ser Ser Gly Asp Phe Gly His Val Arg Leu 

2630 2635 2640 

Trp Tyr Lys Thr Met Ser Gly Thr Ala Glu Ala Gly Leu Asp Phe 

2645 2650 2655 

Val Pro Ala Ala Gly Glu Leu Leu Phe Glu Ala Gly Glu Met Arg 

2660 2665 2670 

Lys Ser Leu His Val Glu He Leu Asp Asp Asp Tyr Pro Glu Gly 

2675 2680 2685 

Pro Glu Glu Phe Ser Leu Thr He Thr Lys Val Glu Leu Gin Gly 

2690 2695 2700 

Arg 



<210> 31 

<211> 610 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 71726948CD1 

<400> 31 

Met Asp Thr Pro Arg Gly He Gly Thr Phe Val Val Trp Asp Tyr 

15 10 15 

Val Val Phe Ala Gly Met Leu Val He Ser Ala Ala He Gly He 

20 25 30 

Tyr Tyr Ala Phe Ala Gly Gly Gly Gin Gin Thr Ser Lys Asp Phe 

35 40 45 

Leu Met Gly Gly Arg Arg Met Thr Ala Val Pro Val Ala Leu Ser 

50 55 60 

Leu Thr Ala Ser Phe Met Ser Ala Val Thr Val Leu Gly Thr Pro 

65 70 75 

Ser Glu Val Tyr Arg Phe Gly Ala He Phe Ser He Phe Ala Phe 

80 85 90 

Thr Tyr Phe Phe Val Val Val He Ser Ala Glu Val Phe Leu Pro 

95 100 105 

Val Phe Tyr Lys Leu Gly He Thr Ser Thr Tyr Glu Tyr Leu Glu 

110 115 120 

Leu Arg Phe Asn Lys Cys Val Arg Leu Cys Gly Thr Val Leu Phe 

125 130 135 

He Val Gin Thr He Leu Tyr Thr Gly He Val He Tyr Ala Pro 

140 145 150 
Ala Leu Ala Leu Asn Gin Val Thr Gly Phe Asp Leu Trp Gly Ala 

155 160 165 
Val Val Ala Thr Gly Val Val Cys Thr Phe Tyr Cys Thr Leu Gly 

170 175 180 
Gly Leu Lys Ala Val He Trp Thr Asp Val Phe Gin Val Gly He 

185 190 195 
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Met Val Ala Gly 


Phe 


Ala 


Ser Val 


He 


He 


Gin Aia va± vai wet 




200 








205 




Gin Gly Gly lie 


Ser 


Thr 


He Leu 


Asn 


Asp 


Aia iyt Asp Kjiy \sjiy 




215 








220 


one 


Arg Leu Asn Phe 


Trp 


Asn 


Phe Asn 


Pro 


Asn 


fro tieu v»j.n Arg nis 




230 








235 




Thr Phe Trp Thr 


lie 


lie 


He Gly Gly Thr 


rfie in± l xp inr o@r 




245 








250 


ZDD 


lie Tyr Gly Val 


Asn 


Gin 


Ser Gin Val 


Gin 


Arg Tyr He Ser Cys 


260 








265 


o -i n 

270 


Lys Ser Arg Phe 


Gin 


Ala 


Lys Leu 


Ser 


Leu 


Tyr He Asn Leu Val 


275 








280 


o o c 
285 


Gly Leu Trp Ala 


He 


Leu Thr Cys 


Ser Val 


Phe Cys Gly Leu Ala 


290 








295 


300 


Leu Tyr Ser Arg 


Tyr 


His Asp Cys Asp 


Pro 


Trp Thr Ala Lys Lys 


305 








310 


315 


Val Ser Ala Pro 


Asp 


Gin 


Leu Met 


Pro 


Tyr 


Leu Val Leu Asp He 




320 








325 


330 


Leu Gin Asp Tyr 


Pro 


Gly Leu Pro Gly Leu 


Phe Val Ala Cys Ala 


335 








340 


345 


Tyr Ser Gly Thr 


Leu 


Ser 


Thr Val 


Ser 


Ser 


Ser He Asn Ala Leu 


350 








355 


360 


Ala Ala Val Thr 


Val 


Glu 


Asp Leu 


He 


Lys 


Pro Tyr Phe Arg Ser 




365 








370 


375 


Leu Ser Glu Arg 


Ser 


Leu 


Ser Trp 


He 


Ser 


Gin Gly Met Ser Val 


380 








385 


390 


Val Tyr Gly Ala 


Leu 


Cys 


He Gly Met Ala 


Ala Leu Ala Ser Leu 


395 








400 


405 


Met Gly Ala Leu 


Leu 


Gin Ala Ala 


Leu 


Ser 


Val Phe Gly Met Val 


410 








415 


420 


Gly Gly Pro Leu 


Met 


Gly Leu Phe Ala Leu 


Gly He Leu Val Pro 


425 








430 


43d 


Phe Ala Asn Ser 


He 


Gly Ala Leu Val Gly 


Leu Met Ala Gly Phe 




440 








445 


i r a 

450 


Ala lie Ser Leu 


Trp 


Val 


Gly He Gly Ala 


Gin He Tyr Pro Pro 




455 








460 


465 


Leu Pro Glu Arg 


Thr 


Leu 


Pro Leu 


His 


Leu 


Asp He Gin Gly Cys 


470 








475 


480 


Asn Ser Thr Tyr 


Asn 


Glu 


Thr Asn 


Leu 


Met 


Thr Thr Thr Glu Met 


485 








490 


495 


Pro Phe Thr Thr 


Ser 


Val 


Phe Gin 


He 


Tyr 


Asn Val Gin Arg Thr 




500 








505 


510 


Pro Leu Met Asp 


Asn 


Trp 


Tyr Ser 


Leu 


Ser 


Tyr Leu Tyr Phe Ser 




515 








520 


525 


Thr Val Gly Thr 


Leu 


Val 


Thr Leu 


Leu 


Val 


Gly He Leu Val Ser 


530 








535 


e J ft 

540 


Leu Ser Thr Gly 


Gly 


Arg 


Lys Gin 


Asn 


Leu 


Asp Pro Arg Tyr He 


545 








550 




Leu Thr Lys Glu 


Asp 


Phe 


Leu Ser 


Asn 


Phe 


Asp He Phe Lys Lys 


560 








565 


570 


Lys Lys His Val 


Leu 


Ser Tyr Lys 


Ser 


His 


Pro Val Glu Asp Gly 


575 








580 


585 


Gly Thr Asp Asn 


Pro 


Ala 


Phe Asn 


His 


He 


Glu Leu Asn Ser Asp 


590 








595 


600 


Gin Ser Gly Lys 


Ser 


Asn Gly Thr Arg Leu 






605 








610 





<210> 32 
<211> 552 
<212> PRT 

<213> Homo sapiens 
<220> 
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<221> misc_feature 

<223> Incyte ID No: 7487393CD1 



<400> 32 

Met Gly Phe Asp Val Leu Leu Asp Gin Val Gly Gly Met Gly Arg 
15 10 15 

Phe Gin He Cys Leu He Ala Phe Phe Cys He Thr Asn He Leu 

20 25 30 

Leu Phe Pro Asn He Val Leu Glu Asn Phe Thr Ala Phe Thr Pro 

35 40 45 

Ser His Arg Cys Trp Val Pro Leu Leu Asp Asn Asp Thr Val Ser 
50 55 60 

Asp Asn Asp Thr Gly Thr Leu Ser Lys Asp Asp Leu Leu Arg He 
65 70 75 

Ser He Pro Leu Asp Ser Asn Leu Arg Pro Gin Lys Cys Gin Arg 
80 85 90 

Phe He His Pro Gin Trp Gin Leu Leu His Leu Asn Gly Thr Phe 
95 100 105 

Pro Asn Thr Asn Glu Pro Asp Thr Glu Pro Cys Val Asp Gly Trp 

110 115 120 

Val Tyr Asp Arg Ser Ser Phe Leu Ser Thr He Val Thr Glu Trp 

125 130 135 

Asp Leu Val Cys Glu Ser Gin Ser Leu Lys Ser Met Val Gin Ser 

140 145 150 

Leu Phe Met Ala Gly Ser Leu Leu Gly Gly Leu He Tyr Gly His 

155 160 165 

Leu Ser Asp Arg Val Gly Arg Lys He He Cys Lys Leu Cys Phe 

170 175 180 

Leu Gin Leu Ala He Ser Asn Thr Cys Ala Ala Phe Ala Pro Thr 

185 190 195 

Phe Leu Val Tyr Cys He Leu Arg Phe Leu Ala Gly Phe Ser Thr 

200 205 210 

Met Thr He Leu Gly Asn Thr Phe He Leu Ser Leu Glu Trp Thr 

215 220 225 

Leu Pro Arg Ser Arg Ser Met Thr He Met Val Leu Leu Cys Ser 

230 235 240 

Tyr Ser Val Gly Gin Met Leu Leu Gly Gly Leu Ala Phe Ala He 

245 250 255 

Gin Asp Trp His He Leu Gin Leu Thr Val Ser Thr Pro He He 

260 265 270 

Val Leu Phe Leu Ser Ser Trp Lys Met Val Glu Ser Ala Arg Trp 

275 280 285 

Leu He He Asn Asn Gin Leu Asp Glu Gly Leu Lys Glu Leu Arg 

290 295 300 

Arg Val Ala His He Asn Gly Lys Lys Asn Thr Glu Glu Thr Leu 

305 310 315 

Thr Thr Glu Leu Val Arg Ser Thr Met Lys Lys Glu Leu Asp Ala 

320 325 330 

Val Arg He Lys Thr Ser He Phe Ser Leu Phe Arg Ala Pro Lys 

335 340 345 

Leu Arg Met Arg Val Phe Gly Leu Cys Phe Val Arg Phe Ala He 

350 355 360 

Thr Val Pro Phe Tyr Gly Leu He Leu Asn Leu Gin His Leu Gly 

365 370 375 

Ser Asn Val Ser Leu Phe Gin He Leu Cys Gly Ala Val Thr Phe 

380 385 390 

Thr Ala Arg Cys Val Ser Leu Leu Thr Leu Asn His Met Gly Arg 

395 400 405 

Arg He Ser Gin He Leu Phe Thr Phe Pro Val Gly Leu Phe He 

410 415 420 

Leu Val Asn Thr Phe Leu Pro Gin Glu Met Gin He Leu Arg Val 

425 430 435 

Val Leu Ala Thr Leu Gly He Gly Ser Val Ser Ala Ala Ser Asn 
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440 






445 






450 


Ser 


Ala 


Ser Val 


His 


His 


Asn 


Glu Leu Val Pro Thr 


lie 


Leu 


Arg 








455 






460 • 






A C C 


Ser 


Thr 


Val Ala 


Gly 


He 


Asn 


Ala Val Ser Gly Arg 


Tnr 


c»iy 


Ala 








470 






475 






A O C\ 


Ala 


Leu 


Ala Pro 


Leu 


Leu 


Met 


Tnr Leu Met Aia ryr 


Ser 


Pro 


HIS 








485 






490 








Leu 


Pro 


Trp He 


Ser 


Tyr 


Gly 


Val Phe Pro He Leu 


Ala 


Val 


Pro 








500 




505 






510 


Val 


lie 


Leu Leu 


Leu 


Pro 


Glu 


Thr Arg Asp Leu Pro 


Leu 


Pro 


Asn 








515 






520 






525 


Thr 


He 


Gin Asp 


Val 


Glu 


Asn 


Asp Arg Lys Asp Ser 


Arg 


Asn 


lie 








530 






535 






540 


Lys Gin 


Glu Asp 


Thr 


Cys 


Met 


Lys Val Thr Gin Phe 














545 






550 









<210> 33 
<211> 2365 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7484831CB1 

<400> 33 

atggagcctg gtgacgcggc gcgccctggc tcgggtcggg ctaccggggc gccgccgccg 60 
cggctgctgc tgctgccgct gctgctgggt tgggggctgc gagtcgcggc cgcggcctcg 120 
gcctcctcct ctggggcggc ggccgaggac agcagcgcca tggaggagct cgctactgag 180 
aaggaggcgg aggagagcca ccggcaagac agcgtgagcc tgctcacctt catcctgctg 240 
ctcacgctca ccatcctcac catctggctc ttcaagcacc gccgggtgcg ctttctgcac 300 
gagaccgggc tggccatgat ctatgggctc atcgttgggg tgatcctgag gtatggtacc 360 
cctgctacca gtggccgtga caaatcactc agctgcactc aggaagacag ggccttcagt 420 
accttattag tgaatgtcag cggaaagttc ttcgaataca ctctgaaagg agaaatcagt 480 
cctggcaaga tcaacagcgt agagcagaat gatatgctac ggaaggtaac attcgatcca 540 
gaagtatttt tcaacattct tctgcctcca attatttttc atgctggata cagcttaaag 600 
aagagacact ttttcagaaa tcttggatct atactggcct atgccttctt ggggactgct 660 
gtttcatgct tcattattgg aaatctcatg tatggtgtgg tgaagctcat gaagattatg 720 
ggacagctct cagataaatt ttactacaca gattgtctct tttttggagc aatcatctct 780 
gccactgacc cagtgactgt gctggcgata tttaatgaat tgcatgcaga cgtggatctt 840 
tacgcacttc tttttggaga gagcgtccta aatgatgctg ttgccattgt actgtcctcg 900 
tctattgttg cctaccagcc agcgggactg aacactcacg cctttgatgc tgctgccttt 960 
tttaagtcag ttggcatttt tctaggtata tttagtggct cttttaccat gggagctgtg 1020 
actggtgttg tgactgctct agtgactaag tttaccaaac tgcactgctt ccccctgctg 1080 
gagacggcgc tgttcttcct catgtcctgg agcacgtttc tcttggcaga agcctgcgga 1140 
tttacaggtg ttgtagctgt ccttttctgt ggaatcacac aagctcatta cacctacaac 1200 
aatctgtcgg tggaatcaag aagtcgaacc aagcagctct ttgaggtgtt acatttcctg 1260 
gcagagaact tcatcttctc ctacatgggc ctggcactgt ttaccttcca gaagcacgtt 1320 
ttcagcccca ttttcatcat cggagctttt gttgccatct tcctgggcag agccgcgcac 1380 
atctacccgc tctccttctt cctcaacttg ggcagaaggc ataagattgg ctggaatttt 1440 
caacacatga tgatgttttc aggcctcagg ggagcaatgg catttgcgtt ggccatccgt 1500 
gacacggcat cctatgctcg ccagatgatg ttcacgacca cccttctcat tgtgttcttc 1560 
actgtctgga tcattggagg aggcacgaca cccatgttgt catggcttaa catcagagtt 1620 
ggcgtcgagg agccctccga agaggaccag aatgaacacc actggcagta cttcagagtt 1680 
ggtgttgacc ccgatcaaga cccaccaccc aacaacgaca gctttcaagt cttacaaggg 1740 
gacggcccag attctgccag aggaaaccgg acaaaacagg agagcacatg gatattcagg 1800 
ctgtggtaca gctttgatca caattatctg aagcccatcc tcacacacag tggtccccca 1860 
ctaaccacca cgctccccgc ctggtgtggc ttactagctc gatgtctgac cagtccccag 1920 
gtgtacgata accaagagcc actgagagag gaagactctg atttcatcct gaccgaaggc 1980 
gacctgacat tgacctacgg ggacagcaca gtgactgcaa atggctcctc aagttcgcac 2040 
accgcctcca cgagtctgga gggcagccgg agaacgaaga gcagctcgga ggaagtgctg 2100 
gagcgagacc tgggaatggg agaccagaag gtttcgagcc ggggcacccg cctagtgttt 2160 
cccctggaag ataatgcttg actttccccc caagccctgg cgcgatggga ctgaagattg 2220 
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aaaatacatc cacgaacatt tcaacatgga acgaagaatt atagttcctt ctctggctat 2280 
atccataaag aagtagttat gaaatgttta aaaccaaagg caaatagtgt tatactctta 2340 
tttttgatta atctgaggaa aggag 2365 



<210> 34 
<211> 3400 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2477266CB1 

<400> 34 

agcggcggct cagttgctgc catggagtca 
ctgctaccgg acaccgagcg catccgtcgg 
gcccccgccg ctttgccggc tctctgcgac 
cgccagtttg cggccgtgct gacccgcaga 
gcggagcaac gggagagcct caagtccctg 
cactgtgtga gcctcagcct ggcccagctc 
gaggcctggc cacagctttt gcagctgctt 
gagagagaga tggggctttt gctgctaagt 
caaccccacc accgggagct tcttcggctt 
cctgggctgc tcttctactc cctgcgcact 
gaagatgtgc ctctcgctcg gatgttggtg 
atccccatag atgaggcaaa ggcctgtgag 
tcagaggtgc cggtcatcac cccctacctc 
gctagaaatg tggccctggg caatgcgata 
ttggtcaaag tcaagagcaa ggccttactg 
acccttttcc ccattgtggc tgctgagccc 
gattcagaag aggaagagtt ggagattgag 
gtacaagttg tggacatgct ggcactacac 
atgcccatgt tggaagaggc tttgcggagc 
ctggtgctgg ccgtgctgtc tgacggagct 
ccactgctgc agattgtgtg caagggcctg 
gcgctgtttg ccctgggcca gttctcagaa 
agggaggtaa tgccactgct cctcgcctac 
cacctagcca aggcctgcta tgccctggag 
cagccctacc ttccggagct tatggaatgc 
ccccgggcca aggagctggc tgtgagcgcc 
tcgctgctgc cctacttccc tgccatcatg 
cgtgaggacc ttcagcctgt gcagatccag 
gcagtggggg agcccatgag gccgctggct 
tgcgaccagg tagacgacgc tgacttgcgg 
tcgggtctga tgggtgaggg cctggcgccc 
ctgtcactgc gttccaccga gggcattgtg 
ctgtttgacg atgagagtga tggggaagaa 
gaagaggatg actcagagat ctcagggtac 
gaagacacct gtgctgccgt gggggagatc 
tacatggaaa gtgtctttga agaagtattt 
cggaaggcag cccatgaggc tctgggtcag 
agctgcccct cggaacccaa cactgctgct 
tcctacatgc aggcagtgaa cagggagcgg 
gccctgacag gggtgctccg cagctgtggg 
gctgagctct gtggcgtgct caaggctgtg 
gacgaggagg aggaagagga agatgatgat 
cacgctggag aggccatccc tgccctggca 
ttctttgccg gtttcctgcc attattggtg 
gagaagtcct ttgcagtggg gaccttggca 
gcccagtttg tgtctcggct gctccctgtg 
gaggtgcgaa gcaatgccat cttcgggatg 
gcccaggaac acttccccaa gctgctgggg 
catgatcgtg tccgtgacaa catctgtggg 



gccgggctag agcagctcct acgggagctg 60 
gccacggaac agctccagat cgttcttcgg 120 
ctgctagcct cggcggccga cccccagatc 180 
cgactgaaca cccgctggcg acggctggcg 240 
atcctgacgg ccctgcagag agaaacagag 300 
tcagccacca tttttcgaaa ggaaggcttg 360 
cagcacagta cccacagccc ccacagccca 420 
gtggtggtga cctcccggcc cgaggccttc 480 
ctgaatgaga ctcttggtga ggtgggctct 540 
ctgaccacca tggctcccta cctcagcact 600 
cccaagctga tcatggccat gcagactctg 660 
gcccttgagg ctttggatga actgttggag 720 
tctgaagtcc tcacattctg cctggaggta 780 
cgcatacgta ttctctgctg cctcactttc 840 
aagaatcgtc tcctgccacc cttgctgcac 900 
ccaccaggcc agttggatcc cgaggaccag 960 
ctgatggggg agactcccaa gcatttcgct 1020 
ctgccccccg agaagctctg tccccagctg 1080 
gagagcccat accagcgcaa agctggactc 1140 
ggcgaccaca tcaggcagag actgctgccc 1200 
gaggacccct cgcaagttgt acgcaatgct 1260 
aacctacagc cccatatcag cagctattca 1320 
ttgaagtcgg tgcctcttgg acacacacac 1380 
aattttgtgg agaacctagg gcccaaggtg 1440 
atgctgcagc ttctgaggaa ccccagcagt 1500 
ctgggagcca ttgctacggc tgcccaggcc 1560 
gagcacctgc gggaattcct gttaacaggc 1620 
agcctggaga cactgggggt gctggcacga 1680 
gaggaatgct gccagctggg tctgggcctc 1740 
cgctgcacgt acagcctatt tgcagcctta 1800 
cacttggaac agatcaccac gctcatgctg 1860 
cctcagtatg acgggagcag ctccttcctt 1920 
gaggaggagc tcatggatga ggatgtggaa 1980 
agcgtggaga atgccttctt cgatgagaag 2040 
tctgtgaaca ccagtgtggc cttccttcca 2100 
aaactgctgg agtgccctca cctgaatgtg 2160 
ttttgctgtg cactgcacaa ggcctgtcaa 2220 
ttgcaggctg ccctggcccg agtcgtgcca 2280 
gaacgccagg tggtgatggc cgtgctggag 2340 
accctcacac tgaagccccc tgggcgcctc 2400 
ctgcagagga agacagcctg tcaggatact 2460 
caggctgaat acgacgccat gttgctggag 2520 
gccgcggctg ggggagactc ctttgcccca 2580 
tgcaagacaa aacagggctg cacagtggca 2640 
gagactattc agggcctggg tgctgcctca 2700 
ctgttgagca ccgcccaaga ggcagacccc 2760 
ggcgtgctgg cagagcatgg gggccaccct 2820 
ctcctttttc ccctcctggc gcgggagcga 2880 
gcacttgccc gcctgttgat ggccagtccc 2940 
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accaggaaac cagagcccca ggtgctggct 
gacttggagg agtgggtcac cattgggcgc 
gaccaggtta tagatgtggc tcccgagctt 
aacaagatcc caccagacac caaggccgca 
cagcacaccg acagctttca agcagctctg 
ctccaggctg tactgggcct ctcctagact 
gcctgcccag gccttaagac cacctctcag 
tgagactcat acccatttgg agccagcccc 

<210> 35 
<211> 4458 
<212> DNA 

<213> Homo sapiens 



gccctactgc atgccctgcc actgaaggag 3000 

ctcttcagct tcctgtacca gagcagccct 3060 

ctgcgtatct gcagcctcat tctggctgac 3X20 

ctgttgctgc tcctgacgtt cctggccaaa 3180 

ggctcactgc ctgttgacaa ggctcaggag 3240 

gcaggctgca gccagtccag agagaataga 3300 

cccagttcag ttctgcctta ccaaagattc 3360 

acttgctgcc 3400 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3552033CB1 



<400> 35 

gttgttgcta gtatgatata gatcatgtgt 
tagtgtcttg cttgttgtat ctatattgtt 
tatcgtttgc cccgtttaat taccctccat 
tgggatgcgt cttcaatcac cgcttttata 
gggggctctt tcggcaccac cgcaagaacc 
tttaaaaaga agatgtgccc aacagtgaaa 
ggcttctgat aggcccagca ttggccctca 
aaaatgaaag tcgcctctcc cgaaaggaca 
acactcaact cagccctacg gatgcttttg 
ccaacaaagc catgtatg.tg cgagtatctt 
tgatgaccaa ggaatggcag ttggagcttc 
tgcagaactt tgaactccag ccaaaactca 
cagctatgac aactggagcg tggatattca 
atgttggcga tgccttgaag gatcatgcct 
gtattgcccc ctggggaatt gtggaaaacc 
ggccatacca gaccatgtcc aatcccatga 
cccacttcat tctggctgac aacgggacca 
gaagacaact ggaaaagcat atttcactcc 
ttcctgtggt ggcactcata gtggaaggag 
accttcgaga cacccctccc gtgccagtgg 
acatcctggc ctttgggcat aaatactcag 
gggaccagct gttggtgact atacagaaga 
atctgttcat catcctcatg gagtgcatga 
tgggatcaga aggacaccag gacattgatt 
ccaatgcctc ggccccagac caactgagct 
ctcgcagcca gatctttatt tacgggcaac 
tgttggatgc cttagttctg gacagagtgg 
taagcatgca ccgttttctc accatctcca 
ggccctcaaa tacattgtac cacttggtca 
actacagaat cagcctgatt gacatcggcc 
atcgctgcaa ctacacgcgc aagcgcttcc 
agaggcccaa agccttgaaa ctgctgggaa 
gaaagacaac caagaaacgt gaagaagagg 
accacttccc cttccctttc catgagctca 
agatggccct gttcttctgg cagcacggtg 
gcaagctctg caaagccatg gctcatgagg 
cccaggagct gaatcacaat tccagagact 
agtcctacaa gcaggacgaa cagctggcca 
ggagcaacgc cacgtgcctg cagcttgccg 
acacgtgcag ccagatgctg ctcaccgaca 
actcaggcct caaggtaatt ctgggaattc 
tcaagaacaa agacgacatg ccctatatgt 
aggaggcaga agaaccagag aagcccacaa 
tagcaatgtt gggacgaaac aacggggagt 



acgtctatgt tagtgctatg tacgttgtta 60 
attgtttcgt tttgtgtgtg tatatattat 120 
ggggccccga gttttatttt ccccccttgg 180 
ccctgatatt cggttcaagg gactttctcc 240 
ggtctcagct caaatcccgg aatgaaggcc 300 
ccccgccccc aggccccata ggtttccttg 360 
cccccagtat ctccggcttc agaattgaga 420 
tccagtctga aaagtggtcc atcagcaaac 480 
ggaccattga gttccaagga ggtggccatt 540 
ttgatacaaa acctgatctc ctcttacacc 600 
ccaagcttct catctctgtc catgggggcc 660 
agcaagtctt tgggaaaggg ctcatcaaag 720 
ctggaggggt taacacaggt gttattcgtc 780 
ctaagtctcg aggaaagata tgcaccatag 840 
aggaggacct cattggaaga gatgttgtcc 900 
gcaagctcac tgttctcaac agcatgcatt 960 
ctggaaaata tggagcagag gtgaaacttc 1020 
agaagataaa cacaagaatc ggtcaaggtg 1080 
gacccaatgt gatctcgatt gttttggagt 1140 
ttgtctgtga tgggagtgga cgggcatcgg 1200 
aagaaggcgg actgataaat gaatctttga 1260 
ctttcacata cactcgaacc caagctcagc 1320 
agaagaagga attgattacg gtatttcgga 1380 
tggctatcct gacagcttta ctcaaaggag 1440 
tagctttagc ctggaacaga gtcgacatcg 1500 
agtggccggt gggatctctg gagcaagcca 1560 
attttgtgaa attactcata gagaatggag 1620 
gactagagga attgtacaat acgagacatg 1680 
gggatgtcaa aaaggggaac ctgcccccag 1740 
tggtgatcga gtacctgatg ggcggggctt 1800 
ggaccctcta ccacaacctc ttcggcccca 1860 
tggaggatga tattcccttg aggcgaggaa 192 0 
tggacattga cttggatgat cctgagatca 1980 
tggtgtgggc tgttctcatg aagcggcaga 2040 
aggaggccat ggccaaggcc ctggtggcct 2100 
cctctgagaa cgacatggtt gacgacattt 2160 
ttggccagct ggctgtggag ctcctggacc 2220 
tgaaactgct gacgtatgag ctgaagaact 2280 
tggctgccaa acaccgcgac ttcatcgcgc 2340 
tgtggatggg ccggctccgc atgcgcaaga 2400 
tacttcctcc ttcaattctc agcttggagt 2460 
ctcaggccca ggaaatccac ctccaagaga 2520 
aggaaaaaga ggaagaggac atggagctca 2580 
cctccaggaa gaaggatgaa gaggaagttc 2640 
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agagcgagca ccggttaatc cccctcggca 
tcgtgaagtt ctggttctac acactggcgt 
tcgtgttagt gaagatggaa cgctggccgc 
ttttcaccct gggaatagaa aagatgagag 
tacagaaagt gaaggtatgg ctgcaggagc 
ttctgttttc tgtcggaatg atccttcgtc 
gggtcatcta ctgcgtgaac atcatttact 
tgaacaagta tttgggcccg tatgtaatga 
actttgtcat cattatgctg gtggttctga 
tttttcccaa tgaggagcca tcatggaaac 
ggatgattta tggggaagtg tttgcggacc 
cccgagagga tggtaaaata atccagctgc 
cggccatcat ggcctgctac ctcttagtgg 
ctgtctttaa caatacattt tttgaagtaa 
agaggtatca gctcatcatg actttccatg 
tcttcagcca catgaccatg atattccagc 
gcgacccgga tgaaagggac tacggcctga 
aagtacatga ctttgaagag caatgcatag 
tcaactcatc taatgatgag aggatacggg 
tgcggctgga ggaagtcaac gagagagagc 
acatccggct ggcgcagctg gaagacctta 
tgacaggtct ggagcgggcc gagtccaaca 
cggacgccgc ctacattgtc cgtcagagca 
agctccaaga gagtatagac cctgcagagc 
agcagttcta cctaacgtcc cacatccttc 
taaagaaaaa acaaggtaac agtcatagtt 
ccacaactgt ttatcctttt ttgaagaaaa 
aatcgaaatg aaagaagagt tgatgttaaa 
agtgggcgtt caagtctagt cgagcaagct 
ttttaattta ttttgtatct cacctgtcat 
tgaaccttaa aaaaaaaa 



gaaaaatcta tgaattctac aatgcaccca 2700 
atatcggata cctgatgctc ttcaactata 2760 
ccacccagga atggatcgta atctcctata 2820 
agattctgat gtcagagcca gggaagttgc 2880 
actggaatgt cacggacctc atcgccatcc 2940 
tccaagacca gcccttcagg agtgacggga 3000 
ggtatatccg tctcctagac atcttcggcg 3060 
tgattggaaa aatgatgata gacatgatgt 3120 
tgagctttgg ggtcgccagg caagccatcc 3180 
tggccaagaa catcttctac atgccctatt 3240 
agatagaccc tccctgtgga cagaatgaga 3300 
ctccctgcaa gacaggagct tggatcgtgc 3360 
caaacatctt gctggtcaac ctcctcattg 3420 
aatcgatatc caaccaagtc tggaagtttc 3480 
aaaggccagt tctgccccca ccactgatca 3540 
acctgtgctg ccgatggagg aaacacgaga 3600 
aactcttcat aaccgatgat gagctcaaga 3660 
aagaatactt cagagaaaag gatgatcggt 3720 
tgacttcaga aagggtggag aacatgtcta 3780 
actccatgaa ggcttcactc cagaccgtgg 3840 
tcgggcgcat ggccacggcc ctggagcgcc 3900 
aaatccgctc gaggacctcg tcagactgca 3960 
gcttcaacag ccaggaaggg aacaccttca 4020 
atcctcttta ttcagtataa gccggcagca 4080 
tcatgccaac acttctgtaa ttgatcatta 4140 
cacctgtctc ttatctattc acttctggtg 4200 
taagggaaca gaaatgcctt tttgtattgc 4260 
aaaacaaaag tcaagtgatt tattatatac 4320 
caggagaatg taattaaata attttatatt 4380 
ggatgaattc attcactgaa tatgtaatat 4440 

4458 



<210> 36 
<211> 2722 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4778139CB1 

<220> 

<221> unsure 
<222> 2609 

<223> a, t, c, g, or other 



<400> 36 

atggagatcg ctggagagaa 
agcctggctg gaggccggca 
aacagtaaaa gagaggctac 
agtatagtgg tttctggggt 
cttctggtcc cctctaatct 
tccagaccct ctgtgccaga 
gttcatctga ctgggcagac 
gaaaagcacg agcctattgc 
cccactttga gttgcacacc 
atcatgcaaa ctggaggtgg 
gtgtttagca tcaagaaaaa 
tctcggagtg aacagcacca 
agatcggccc agagtagtga 
ttttggatca atcctctccg 
cataaggagg agaaattgat 
atagggcagg gagcatgggt 



gaaataccag tattatagca 
tcccccaaga gtgcagaggg 
aaatgtaaga aatgaccagg 
gtctcctaac aggcaagcac 
aaagcgagtt cctttttcag 
tgtgctggca actgaacagg 
cccagttgtc tctgactggg 
tcacttactg gatggacaag 
tgaacccatg acaatgagct 
atatgtaaac tgggcatttt 
gtggcaaacc tgcttgccct 
gaagcaggcc caggacagct 
atgctcagag gtgggaccat 
cagatacagg cccttcgcta 
gaagatctgt aagattaaaa 
caaagcgaaa atgctaacca 



tgccctcttc tttgctgagg 60 
gggcacttct tgagattaca 120 
aaaggcaaga aacacaaagt 180 
actcaaagta tggccagttt 240 
cagaaactgt cttgcctctg 300 
acatccagac tgaggttctt 360 
catcagtgga tgaacccaag 420 
acaaggcaga gcaagtgcta 480 
cccctctttc ccaagccaag 540 
cagaaggtga tgaaactggt 600 
ccacttgtga cagtgattcc 660 
ccctatctga taactcaaca 720 
ggcttcagcc aaacacatcc 780 
ggagtcatag ttttagattc 840 
atctttcagg ctcttcagaa 900 
aagacaggag actgtcaaag 960 
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aaaaagaaga atacteaagg actccaggtg ccaatcataa cagtcaatgc ctgctctcag 1020 

agtgaccagt tgaatccaga gccaggagaa aacagcatct ctgaagagga gtacagcaag 1080 

aactggttca cagtgtccaa atttagtcac acaggtgtag aaccttacat acatcagaaa 1140 

atgaaaacta aagaaattgg acaatgtgct atacaaatca gtgattacct aaagcagtct 1200 

caagagtctg ctcaggatct cagcaaaaac tctttgtgga attccaggag caccaacctc 1260 

aataggaact ccctgtcctc tctaatttca gagatctcag cctccttaaa aagccctcaa 1320 

gagcctcacc atcattattc accttcactt ctctttgcag caggagaaga aataactgtc 1380 

tacaggttgg aggagagttc ccctttaaac cttgataaaa gcatgtcctc ttggtctcag 1440 

cgtgggagag cggcaatgat ccaggtattg tcccgagagg agatggatgg gggcctccgt 1500 

aaagctatga gagtcgtcag cacttggtct gaggatgaca ttctcaagcc gggacaagtt 1560 

ttcattgtca agtcctttct tcctgaggtt gtgcggacat ggcataaaat cttccaggag 1620 

agcactgtgc ttcatctttg cctcagggaa attcaacaac aaagagctgc tcaaaaattg 1680 

atctatacct tcaaccaagt gaaaccacaa accataccct acacaccaag gttcctggaa 1740 

gttttcttaa tctactgcca ttcagccaac cagtggttga ccattgagaa gtatatgaca 1800 

ggggagttcc ggaagtataa caacaacaat ggtgatgaaa tcacccccac caacaccctg 1860 

gaggagctga tgttggcttt ctctcactgg acctatgagt acactcgggg agagctgctg 1920 

gttttagatt tgcaaggtgt tggagaaaat ttgacagatc catctgttat aaaacctgaa 1980 

gtcaaacaat caagaggaat ggtgtttgga ccggccaatt tgggggaaga tgcaattaga 2040 

aacttcattg caaaacatca ttggaactcc tgctgccgga agctcaaact cccggattta 2100 

aaaagaaatg actattcccc tgaaaggata aattccacct ttggacttga gataaaaata 2160 

gaatcagctg aggagcctcc agcaagggag acgggtagaa attccccaga agatgatatg 2220 

caactataaa aagggaggag caagaagatc ccagtgcttg ccctgcctgc caggaactct 2280 

gtgataacat agattgatca acgtgatgtt gattacatca gcgtctcctt gggacacgcc 2340 

ttctgagcct cacatctcct tctgttcaaa ggcctcattg gtatatgatc aatgggttct 2400 

cctagacact gacctctgtc cagggcactt tgcagctcca tcctcaagtt ccacacgaag 2460 

atgcttggat gagtcagctg ggaatattgt tcttgtgtac ctcattgctt tagctggtca 2520 

cttggaactt tggagcagaa tcctgcacat taaaggatgg ggttgggggg gatacattta 2580 

ttttattttc tcactatgta tgcagactng accccctact actatttgtc acctcaccca 2640 

cagattgtat ttatgtctat atatatgttt cataaaaagc ttaccaaacc caaaaaaaaa 2700 

aaaaaaaaaa aaaaaaaaaa aa 2722 

<210> 37 

<211> 1924 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4787433CB1 

<400> 37 

cggcgcccag tgcacggccg caccatgggg tcccgccact tcgaggggat ttatgaccac 60 

gtggggcact. tcggcagatt ccagagagtc ctctatttca tatgtgcctt ccagaacatc 120 

tcttgtggta ttcactactt ggcttctgtg ttcatgggag tcacccctca tcatgtctgc 180 

aggcccccag gcaatgtgag tcaggttgtt ttccataatc actctaattg gagtttggag 240 

gacaccgggg ccctgttgtc ttcaggccag aaagattatg ttacggtgca gttgcagaat 300 

ggtgagatct gggagctctc aaggtgtagc aggaataaga gggagaacac atcgagtttg 360 

ggctatgaat acactggcag taagaaagag tttccttgtg tggatggcta catatatgac 420 

cagaacacat ggaaaagcac tgcggtgacc cagtggaacc tggtctgtga ccgaaaatgg 480 

cttgcaatgc tgatccagcc cctatttatg tttggagtcc tactgggatc ggtgactttt 540 

ggctactttt ctgacaggct aggacgccgg gtggtcttgt gggccacaag cagtagcatg 600 

tttttgtttg gaatagcagc ggcgtttgca gttgattatt acaccttcat ggctgctcgc 660 

ttttttcttg ccatggttgc aagtggctat cttgtggtgg ggtttgtcta tgtgatggaa 720 

ttcattggca tgaagtctcg gacatgggcg tctgtccatt tgcattcctt ttttgcagtt 780 

ggaaccctgc tggtggcttt gacaggatac ttggtcagga cctggtggct ttaccagatg 840 

atcctctcca cagtgactgt cccctttatc ctgtgctgtt gggtgctccc agagacacct 900 

ttttggcttc tctcagaggg acgatatgaa gaagcacaaa aaatagttga catcatggcc 960 

aagtggaaca gggcaagctc ctgtaaactg tcagaacttt tatcactgga cctacaaggt 1020 

cctgttagta atagccccac tgaagttcag aagcacaacc tatcatatct gttttataac 1080 

tggagcatta cgaaaaggac acttaccgtt tggctaatct ggttcactgg aagtttggga 1140 

ttctactcgt tttccttgaa ttctgttaac ttaggaggca atgaatactt aaacctcttc 1200 

ctcctgggtg tagtggaaat tcccgcctac accttcgtgt gcatcgccat ggacaaggtc 1260 

gggaggagaa cagtcctggc ctactctctt ttctgcagtg cactggcctg tggtgtcgtt 1320 
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atggtgatcc cccagaaaca ttatattttg 
gccatcgggg cagcatttgg cctcatttat 
gtaagatcgc tggctgtggg aagcggcagc 
ccgttctctg tggacctcag cagcatttgg 
atggccctcc tgagtggagt gttaacacta 
gcaactactt gggaggaggc tgcaaaactg 
ttacttctca caactaataa tagtgggctg 
tctggtcttg gtgaataaat gtgccatgcc 
ctaatgcctt tgtattagag gaatcttatt 
tttaataaat tttgtaagaa aattttaaag 
gatg 



ggtgtggtga cagctatggt tggaaaattt 1380 
ctttatacag ctgagctgta tccaaccatt 1440 
atggtgtgtc gcctggccag catcctggcg 1500 
atcttcatac cacagttgtt tgttgggact 1560 
aagcttccag aaacccttgg gaaacggcta 1620 
gagtcagaga atgaaagcaa gtcaagcaaa 1680 
gaaaaaacgg aagcgattac ccccagggat 1740 
tgctgtctag cacctgaaat attatttacc 1800 
ctcatctccc atatgttgtt tgtatgtctt 1860 
caaatatgtt ataaaagaaa taaaaactaa 1920 

1924 



<210> 38 

<211> 1797 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7483598CB1 



<400> 38 

agcatccccg tcccggagga aaaaacattt 
aggctgccac agcctagaga tcttggggct 
ggggcgcccg tggaactggc tccctgcagc 
gcgaggtggc ggaggaggct ggccgggtgc 
ctgctgagga ccaccgctca gccatgggct 
agagagattt agatgacaga gaaacccttg 
gtcagtctgc tgctcttttt aatgttgtca 
tgccttattc aatgaagcaa gctgggtttc 
catatgttac agacttttcc cttgttttat 
atacctacca gtctttggtc aataaaactt 
ttcttcagtt tttgtatcct tttatagcaa 
ctttgagcaa agtttttcaa agaatcccag 
gccacttcat tattggactt tccacagtta 
atatagcaaa gcttggaaag gtctccctca 
gaattgtaat ggcaagggca atttcactgg 
gggtatttgc aaagcccaat gccattcaag 
gccaccataa ctccttctta gtttacagtt 
cccgccttat ccatatgtcc atcgtgattt 
gtggatactt gacatttact ggcttcaccc 
atgatgacct ggtaacattt ggaagatttt 
ctatggaatg ctttgtgaca agagaggtaa 
catcggtttt ccacattgtt gtaacagtga 
tgctgattga ttgcctcggg atagttctag 
tcatttttat cattccatca gcctgttatc 
ccgataagat tatgtcttgt gtcatgcttc 
tcgtcatggc tattacaaat actcaagact 
ttcctgacaa tttctctctc acaaatacct 
tttctacttt aaatattagt atctttcaat 
tcatagactt taaaacacat aacatttacg 
aattattatt ttggctttta tcaagacttg 



gtctggcgaa ctccgggtgg aaagcgcccc 60 
tcagcccctc gcggcctgcc gagggagcag 120 
tctgcggcta cacgcggacc tcggctgtgt 180 
gaatccgtac ccagccccag catcttccac 240 
accagaggca ggagcctgtc atcccgccgc 300 
tttctgaaca tgagtataaa gagaaaacct 360 
actcgattat aggatctggt ataataggat 420 
ctttgggaat attgctttta ttctgggttt 480 
tgataaaagg aggggccctc tctggaacag 540 
tcggctttcc agggtatctg ctcctctctg 600 
tgataagtta caatataata gctggagata 660 
gagttgatcc tgaaaacgtg tttattggtc 720 
cctttactct gcctttatcc ttgtaccgaa 780 
tctctacagg tttaacaact ctgattcttg 840 
gtccacacat accaaaaaca gaagacgctt 900 
cggtcggggt tatgtctttt gcatttattt 960 
ctctagaaga acccacagta gctaagtggt 1020 
ctgtatttat ctgtatattc tttgctacat 1080 
aaggggactt atttgaaaat tactgcagaa 1140 
gttatggtgt cactgtcatt ttgacatacc 1200 
ttgccaatgt gttttttggt gggaatcttt 1260 
tggtcatcac tgtagccacg cttgtgtcat 1320 
aactcaatgg tgtgctctgt gcaactcccc 1380 
tgaaactgtc tgaagaacca aggacacact 1440 
ccattggtgc tgtggtgatg gtttttggat 1500 
gcacccatgg gcaggaaatg ttctactgct 1560 
cagagtctca tgttcagcag acaacacaac 1620 
gagttgactg ctttaaaaat atgtatgttt 1680 
cttgctttag tctgtattta tgttatataa 1740 
gcttttatga gtagtgcaat ataaaaa 1797 



<210> 39 

<211> 3277 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7484823CB1 



<400> 39 
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atgggggaag gagatagtgg gttgaggagc 
agtagggaaa aacagaagcg caatatggag 
cacaaattaa ccttggaaga gctgagcacc 
agccaccaaa gggcaaagga aatcctgact 
cccaccactc cagaatgggt caaattctgt 
ctatggactg gggccattct ctgctttgtg 
gagcctacca aagacaacct ctacctgagc 
ggctgcttct cctattatca ggaggccaag 
atggtgcctc agcaagctct ggtaattcga 
gaggtggtgt tgggagacct ggtggaaatc 
cggcttatct ctgcacaagg atgtaaggtg 
ccccagagcc gctcccctga cttcacccat 
ttcttttcca ccaactgtgt ggaaggaacc 
tccacagtga tgggcagaat tgcctccctg 
atcgctgctg agatcgaaca cttcatccat 
gtcacttttt ttgcgctctc acttctcttg 
ctcattggca tcattgtggc caatgtgcct 
ctgaccctca cagccaagcg catggcgcgg 
gtggagacgc tgggctccac gtccaccatc 
aaccgcatga ccgtcgccca catgtggttt 
gaagaacaga ctggaaaaac atttaccaag 
atcgctggcc tctgcaaccg ggctgacttt 
aagagggcca caacaggtga tgcttccgag 
tacagctctg tggcggagat gagagagaaa 
tctaccaaca agtaccagat gtccatccac 
ctgatgatga agggtgctcc ggagaggatc 
gggcaggagt actcaatgaa cgatgaaatg 
ctgggaggtc tgggggaacg tgtgctaggc 
tccaagggat tcccatttaa tacagatgaa 
gtgggcctca tatccatgat tgaccctccc 
tgtcgcagtg caggaattaa ggtgatcatg 
gccattgcca agggtgtggg catcatctca 
gcccggctta agatccctat cagcaaggtc 
catggtgcag aactgaagga catacagtcc 
cctgagatcg tgtttgctcg gacctcccct 
cagaggctgg gagccgttgt ggccgtgaca 
aagaaggctg acattggcat tgccatgggc 
gccgacatga tcctgctgga tgacaacttt 
cgcctgatct ttgacaacct gaagaaatcc 
gagatcacgc ccttcctgat gttcatcatc 
accatcctct gcattgatct cggcactgac 
tcagctgaaa gcgacatcat gaagaggctt 
aaccaccgtc tcattggcat ggcctatgga 
ttctttacct actttgtaat cctggctgag 
atccgcctcc actgggaaga taaatacttg 
tggacctatg agcaacgaaa agttgtggag 
atcgtggttg tgcagtgggc ggatctcatc 
cagcagggca tgagaaacaa agtcttaata 
gcatttctgt cctacactcc aggcatggac 
acctggtggc tctgtgccat tccctacagt 
aaactcctca tccgtcagca cccggatggc 
cagcagatga agagcttcat gtgacacagg 
attataagtt tgacacaaca tctgagacac 
ggcaatcctg ggctggcttg agggaatcat 
cccagcctgc atctagctgg agccccgcag 

<210> 40 
<211> 2773 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 



ccaagaggcc cacaggaagg tgggaaatgc 60 
gaactgaaga aggaagtggt catggatgat 120 
aagtactccg tggacctgac aaagggccat 180 
cgaggtggac ccaatactgt taccccaccc 240 
aagcaactgt tcggaggctt ctccctccta 300 
gcctacagca tccagatata tttcaatgag 360 
atcgtactgt ccgtcgtggt catcgtcact 420 
agctccaaga tcatggagtc ttttaagaac 480 
ggaggagaga agatgcaaat taatgtacaa 540 
aagggtggag accgagtccc tgctgacctc 600 
gacaactcat ccttgactgg ggagtcagaa 660 
gagaaccctc tggagacccg aaacatctgc 720 
gcccggggta ttgtgattgc tacgggagac 780 
acgtcaggcc tggcggttgg ccagacacct 840 
ctgatcactg tggtggccgt cttccttggt 900 
ggctatggtt ggctggaggc tatcattttt 960 
gaggggctgt tggccacagt cactgtgtgc 1020 
aagaactgcc tggtgaagaa cctggaggcg 1080 
tgctcagaca agacgggcac cctcacccag 1140 
gatatgaccg tgtatgaggc cgacaccact 1200 
agctctgata cctggtttat gctggcccga 1260 
aaggctaatc aggagatcct gcccattgct 1320 
tcagccctcc tcaagttcat cgagcagtct 1380 
aaccccaagg tggcagaggt tccctttaat 1440 
cttcgggagg acagctccca gacccacgta 1500 
ttggagtttt gttctacctt tcttctgaat 1560 
aaggaagcct tccaaaatgc ctatttagaa 1620 
ttctgcttct tgaatctgcc tagcagcttc 1680 
ataaatttcc ccatggacaa cctttgtttt 1740 
cgagctgcag tgcctgatgc tgtgagcaag 1800 
gtaacaggag atcatcccat tacagctaag 1860 
gaaggcactg agacggcaga ggaagtcgct 1920 
gatgccagtg ctgccaaagc cattgtggtg 1980 
aagcagcttg atcagatcct ccagaaccac 2040 
cagcagaagc tcatcattgt cgagggatgt 2100 
ggtgacgggg tgaacgactc ccctgcgctg 2160 
atctctggct ctgacgtctc taagcaggca 2220 
gcctccatcg tcacgggggt ggaggagggc 2280 
atcatgtaca ccctgaccag caacatcccc 2340 
ctcggtatac ccctgcctct gggaaccata 2400 
atggtccctg ccatctcctt ggcttatgag 2460 
ccaaggaacc caaagacgga taatctggtg 2520 
cagattggga tgatccaggc tctggctgga 2580 
aatggtttta ggcctgttga tctgctgggc 2640 
aatgacctgg aggacagcta cggacagcag 2700 
ttcacatgcc aaacggcctt ttttgtcacc 2760 
atctccaaga ctcgccgcaa ctcacttttc 2820 
tttgggatcc tggaggagac actcttggct 2880 
gtggccctgc gaatgtaccc actcaagata 2940 
attctcatct tcgtctatga tgaaatcaga 3000 
tgggtggaaa gggagacgta ctactaaact 3060 
ggtgttgtga gagctgggat ggggccagag 3120 
taggatgaat tatcttggat gagaaagatg 3180 
gggcagagga tgaggtgggc tgaagggaag 3240 
ggagggg 3277 
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<223> Incyte ID No: 143935CB1 



<400> 40 

atggaagaga tgtcaggaga aagtgtggtg 
accacttcct tcaagggcac gagccccagc 
gccctctact ataccaccat gcagacgctg 
ttcagcgggc gcatggaagt gctcaccgac 
gggaagcact ttggtacgat actcaactac 
agccgccggg agatcgagga gctgctagca 
gtggaagagt gccaggcggc cctacaaaac 
cctgtgatca cctcatccaa ggaagaacaa 
gtgaagttgc tctacaacag aagtaacaac 
aatatgttga aaaacattga actgtttgat 
ctgttcataa aggatgttat tggggatgaa 
cggaagattg ctgaagtctg ttgtacctcc 
aaggtggagt ttcccgaagc ccggatttat 
gcccaggatg gccggggacc tgacaatgcg 
cgctcccacc acctggacga ggacgaggag 
cacatcaagc gccctgatga ccgggcccac 
cgccctcctc tcaccgcccc cactccctgc 
cgggcccctt ctgcttccct tggagcctgg 
tttggtattg cttgacaagg caaattgatt 
tgaacaagct gtgtctaaga tctctacttt 
gctttctcgg ttctcagagg aaaagtatga 
tttgcaatat ttatttttgt gggtgtcgac 
cttaagggtt cagtttgaca attctgagag 
tctgagctcc ctgcttccta tttcataatc 
ctgtgtgttt ggcccgggca caatccccac 
gctttgggcc tagagcttgt tgataattgc 
agcgtctaaa tcgttgagac cagtgcaact 
ccttttgcca ccttgggctg gtctttggcc 
taggagcaga actttccatg gcagtaagtg 
tccagccccg tccagttgtt ctcctgattg 
ccctgtgaca ggcatagctc ataaactacc 
ccttccccat gaagctgggc taactttcta 
catacccttt gtcctcccag cctggcatcc 
agggagcttc cctgctataa acacagaccc 
gtagagtggc cagcagtgaa caggttgagg 
ggtttgtgtc tgtgttcatg tttgtttaag 
gcttgtgggg cacaggtggc ggcccctgct 
gacgggtgtt ctcagtgacc tacctcccag 
gagtcgtgac tgacaggggt ggttgagact 
gaatgtgcgt caggtgatgg atgggtttgt 
gtattaacat gggcagcctt tgacacatgt 
gtttttacag cttagacttt gtacttactg 
ttgtatccaa cagcaaagtc tacaataaaa 
tcgtgtattg ggcagatgct ttttaaactg 
tggattttat taagtgctaa ggaaagaggg 
tgtttaagtg ttaaacacca accctgtctt 
aagtgaatgt tgg 

<210> 41 

<211> 2966 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5923789CB1 

<400> 41 

tttttccaag tgagagactg ccccatatgt 
cagttgtatt ggtgggaacc tgccgtttgc 



agctcagcgg tgccagcggc tgctacccgc 60 
tccaaatacg tgaagctgaa tgtgggtgga 120 
accaagcagg acaccatgct gaaggccatg 180 
agtgaaggct ggatcctcat tgaccgctgt 240 
cttcgagacg gggcggtgcc tttacccgag 300 
gaagccaagt actacctagt ccaaggcctg 360 
aaagatactt atgagccttt ctgcaaggtc 420 
aaacttatag cgacttcaaa taagccagcc 480 
aaatactcat ataccagcaa ttctgacgac 540 
aagctgtctc tgcgctttaa cggaagggtc 600 
atctgctgct ggtcctttta tggtcagggc 660 
atcgtctatg ccactgagaa gaaacagacc 720 
gaggagaccc tgaacatttt gctgtatgag 780 
ctcctggagg ccacaggcgg ggcggcgggg 840 
cgggagcgga tcgagcgcgt gcggaggatc 900 
ctccaccagt gagcaggcaa gagaccgagc 960 
cgtgctacac ccagatcctg tgcaggctgc 1020 
agatactttt gtaacaagcc agatgattat 1080 
gtcttgaccc aggcgtatga cccctgtcgt 1140 
tcatgagaat ctgagactct ttggagccag 1200 
atgagtgtga agtgtatgtg agaacttttg 1260 
ttcctatgtg ggctttttgg gtgacactcc 1320 
ttgtcctgca gttggaggcc accagaggta 1380 
ctccagcccc agcaggtcca ctcctggttc 1440 
tgctttgcta gacgtgcttt ctgccatgtg 1500 
agcttgtggc agtggaaata tggctgaatg 1560 
ttgggtgcaa ggctttgttt agggatcaag 1620 
tggtgctcac tgggacccca tatgtctgcg 1680 
tccagctctg tttctggttc tttccccaac 1740 
acccgactcc actccaggaa ggccatctga 1800 
cctccctggg atcccgctcc tcttcagcct 1860 
agtcattttg cttagaaatt cagtgtggcc 1920 
aggcagggac accctcacac caccagcccc 1980 
ccttgtcttt gcctctgatt tttacacagt 2040 
atgtgcgggt agatagataa ctttgggtct 2100 
ggatatgtgt gactgtgggt ggggacgtgt 2160 
ggagcccggc tgggcgcagc gcctatgtag 2220 
gctcctctgc acctgcaaag gaacaggagt 2280 
agactaggta gagtagttac caggagatgt 2340 
caagggaatc gttaccgttt tataccaaag 2400 
attccaaaaa cgagtttata ttttcaaacg 2460 
ccctgcctgt gacagttgta tgccttcatt 2520 
ctttaaaaca atcatgactg aatgtcaaaa 2580 
tcgtgtgaga aacttttata ttaggccatt 2640 
cttacaaaat gtttcgtaaa tattttatac 2700 
tcttttgggt tgagcttttt tagaaagtcg 2760 

2773 



tagtgcaata tgtcactgga ggtgaagcat 60 
tgtccccttt ttcctcatgc cttttcctgc 120 
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ctctctgatc ttttctaggt ctctggccta tcaggaggac aactggtgct gcaatagaag 180 
ccagtggcta agtctcgtgt atggcgtggt taaggttgca gcctctcacc tctgccttcc 240 
tccattttgg gctggttacc tttgtgctct tcctgaatgg tcttcgagca gaggctggtg 300 
gctcagggga cgtgccaagc acagggcaga acaatgagtc ctgttcaggg tcatcggact 360 
gcaaggaggg tgtcatcctg ccaatctggt acccggagaa cccttccctt ggggacaaga 420 
ttgccagggt cattgtctat tttgtggccc tgatatacat gttccttggg gtgtccatca 480 
ttgctgaccg cttcatggca tctattgaag tcatcacctc tcaagagagg gaggtgacaa 540 
ttaagaaacc caatggagaa accagcacaa ccactattcg ggtctggaat gaaactgtct 600 
ccaacctgac ccttatggcc ctgggttcct ctgctcctga gatactcctc tctttaattg 660 
aggtgtgtgg tcatgggttc attgctggtg atctgggacc ttctaccatt gtagggagtg 720 
cagccttcaa catgttcatc atcattggca tctgtgtcta cgtgatccca gacggagaga 780 
ctcgcaagat caagcatcta cgagtcttct tcatcaccgc tgcttggagt atctttgcct 840 
acatctggct ctatatgatt ctggcagtct tctcccctgg tgtggtccag gtttgggaag 900 
gcctcctcac tctcttcttc tttccagtgt gtgtccttct ggcctgggtg gcagataaac 960 
gactgctctt ctacaaatac atgcacaaaa agtaccgcac agacaaacac cgaggaatta 1020 
tcatagagac agagggtgac caccctaagg gcattgagat ggatgggaaa atgatgaatt 1080 
cccattttct agatgggaac ctggtgcccc tggaagggaa ggaagtggat gagtcccgca 1140 
gagagatgat ccggattctc aaggatctga agcaaaaaca cccagagaag gacttagatc 1200 
agctggtgga gatggccaat tactatgctc tttcccacca acagaagagc cgtgccttct 1260 
accgtatcca agccactcgt atgatgactg gtgcaggcaa tatcctgaag aaacatgcag 1320 
cagaacaagc caagaaggcc tccagcatga gcgaggtgca caccgatgag cctgaggact 1380 
ttatttccaa ggtcttcttt gacccatgtt cttaccagtg cctggagaac tgtggggctg 1440 
tactcctgac agtggtgagg aaagggggag acatgtcaaa gaccatgtat gtggactaca 1500 
aaacagagga tggttctgcc aatgcagggg ctgactatga gttcacagag ggcacggtgg 1560 
ttctgaagcc aggagagacc cagaaggagt tctccgtggg cataattgat gacgacattt 1620 
ttgaggagga tgaacacttc tttgtaaggt tgagcaatgt ccgcatagag gaggagcagc 1680 
cagaggaggg gatgcctcca gcaatattca acagtcttcc cttgcctcgg gctgtcctag 1740 
cctccccttg tgtggccaca gttaccatct tggatgatga ccatgcaggc atcttcactt 1800 
ttgaatgtga tactattcat gtcagtgaga gtattggtgt tatggaggtc aaggttctgc 1860 
ggacatcagg tgcccggggt acagtcatcg tcccctttag gacagtagaa gggacagcca 1920 
agggtggcgg tgaggacttt gaagacacat atggggagtt ggaattcaag aatgatgaaa 1980 
ctgtgaaaac cataagggtt aaaatagtag atgaggagga atacgaaagg caagagaatt 2040 
tcttcattgc ccttggtgaa ccgaaatgga tggaacgtgg aatatcagat gtgacagaca 2100 
ggaagctgac tatggaagaa gaggaggcca agaggatagc agagatggga aagccagtat 2160 
tgggtgaaca ccccaaacta gaagtcatca ttgaagagtc ctatgagttc aagactacgg 2220 
tggacaaact gatcaagaag acaaacctgg ccttggttgt ggggacccat tcctggaggg 2280 
accagttcat ggaggccatc accgtcagtg cagcagggga tgaggatgag gatgaatccg 2340 
gggaggagag gctgccctcc tgctttgact acgtcatgca cttcctgact gtcttctgga 2400 
aggtgctgtt tgcctgtgtg ccccccacag agtactgcca cggctgggcc tgcttcgccg 2460 
tctccatcct catcattggc atgctcaccg ccatcattgg ggacctggcc tcgcacttcg 2520 
gctgcaccat tggtctcaaa gattcagtca cagctgttgt tttcgtggca tttggcacct 2580 
ctgtcccaga tacgtttgcc agcaaagctg ctgccctcca ggatgtatat gcagacgcct 2640 
ccattggcaa cgtgacgggc agcaacgccg tcaatgtctt cctgggcatc ggcctggcct 2700 
ggtccgtggc cgccatctac tgggctctgc agggacagga gttccacgtg tcggccggca 2760 
cactggcctt ctccgtcacc ctcttcacca tctttgcatt tgtctgcatc agcgtgctct 2820 
tgtaccgaag gcggccgcac ctgggagggg agcttggtgg cccccgtggc tgcaagctcg 2880 
ccacaacatg gctctttgtg agcctgtggc tcctctacat actctttgcc acactagagg 2940 
cctattgcta catcaagggg ttctaa 2966 

<210> 42 
<211> 4404 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6046484CB1 

<400> 42 

atgtatatcc gtgtatccta tgacaccaag ccagactcac tgctccatct catggtgaaa 60 
gattggcagc tggaactccc caagctctta atatctgtgc atggaggcct ccagaacttt 120 
gagatgcagc ccaagctgaa acaagtcttt gggaaaggcc tgatcaaggc tgctatgacc 180 
accggggcct ggatcttcac cgggggtgtc agcacaggtg ttatcagcca cgtaggggat 240 
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gccttgaaag accactcctc caagtccaga ggccgggttt gtgctatagg aattgctcca 300 
tggggcatcg tggagaataa ggaagacctg gttggaaagg atgtaacaag agtgtaccag 360 
accatgtcca accctctaag taagctctct gtgctcaaca actcccacac ccacttcatc 420 
ctggctgaca atggcaccct gggcaagtat ggcgccgagg tgaaacttcg aagacaactg 480 
gaaaagcata tttcactcca gaagataaac acaagaatcg gtcaaggtgt tcctgtggtg 540 
gcactcatag tggaaggagg acccaatgtg atctcgattg ttttggagta ccttcgagac 600 
acccctcccg tgccagtggt tgtctgtgat gggagtggac gggcatcgga catcctggcc 660 
tttgggcata aatactcaga agaaggcgga ctgataaatg aatctttgag ggaccagctg 720 
ttggtgacta tacagaagac tttcacatac actcgaaccc aagctcagca tctgttcatc 780 
atcctcatgg agtgcatgaa gaagaaggaa ttgattacgg tatttcggat gggatcagaa 840 
ggacaccagg acattgattt ggctatcctg acagctttac tcaaaggagc caatgcctcg 900 
gccccagacc aactgagctt agctttagcc tggaacagag tcgacatcgc tcgcagccag 960 
atctttattt acgggcaaca gtggccggtg ggatctctgg agcaagccat gttggatgcc 1020 
ttagttctgg acagagtgga ttttgtgaaa ttactcatag agaatggagt aagcatgcac 1080 
cgttttctca ccatctccag actagaggaa ttgtacaata cgagacatgg gccctcaaat 1140 
acattgtacc acttggtcag ggatgtcaaa aaggggaacc tgcccccaga ctacagaatc 1200 
agcctgattg acatcggcct ggtgatcgag tacctgatgg gcggggctta tcgctgcaac 1260 
tacacgcgca agcgcttccg gaccctctac cacaacctct tcggccccaa gagggatgat 1320 
attcccttga ggcgaggaag aaagacaacc aagaaacgtg aagaagaggt ggacattgac 1380 
ttggatgatc ctgagatcaa ccacttcccc ttccctttcc atgagctcat ggtgtgggct 1440 
gttctcatga agcggcagaa gatggccctg ttcttctggc agcacggtga ggaggccatg 1500 
gccaaggccc tggtggcctg caagctctgc aaagccatgg ctcatgaggc ctctgagaac 1560 
gacatggttg acgacatttc ccaggagctg aatcacaatt ccagagactt tggccagctg 1620 
gctgtggagc tcctggacca gtcctacaag caggacgaac agctggccat gaaactgctg 1680 
acgtatgagc tgaagaactg gagcaacgcc acgtgcctgc agcttgccgt ggctgccaaa 1740 
caccgcgact tcatcgcgca cacgtgcagc cagatgctgc tcaccgacat gtggatgggc 1800 
cggctccgca tgcgcaagaa ctcaggcctc aaggtaattc tgggaattct tctacccccc 1860 
accatcttgt ttttggaatt tcgcacatat gatgatttct cgtatcaaac atccaaggaa 1920 
aatgaggatg gcaaagaaaa agaagaggaa aatacggatg caaatgcaga tgctggctca 1980 
agaaaggggg atgaggagaa cgagcacaaa aaacagagaa gtattcccat cggaacaaag 2040 
atctgtgaat tctataacgc gcccattgtc aagttctggt tctttcagat atcatacttg 2100 
ggctacctgc tgctgtttaa ctacgtcatc ctggtgcgga tggatggctg gccgtccctc 2160 
caggagtgga tcgtcatctc ctacatcgtg agcctggcgt tagagaagat acgagaggtg 2220 
gctactccaa aggcctcccc cagcccacta gccagaaaat caaagtttgg cttccaggag 2280 
tactggaaca tcacagatct cgtggccatt tccacattca tgattggagc aattcttcgc 2340 
ctacagaacc agccctacat gggctatggc cgggtgatct actgtgtgga tatcatcttc 2400 
tggtacatcc gtgtcctgga catctttggt gtcaacaagt atctggggcc atacgtgatg 2460 
atgattggaa agatggttag tagtggcatc ctgtgggtag tcatcatgct ggtcgtgctc 2520 
atgagtttcg gagtagcccg tcaagccatt ctgcatccag aggagaagcc ctcttggaaa 2580 
ctggcccgaa acatcttcta catgccctac tggatgatct atggagaggt gtttgcagac 2640 
cagatagacc gtaagagttt ttttctttca gctccttgtg gtgagaacct atatgatgag 2700 
gagggcaagc ggcttcctcc ctgtatcccc ggcgcctggc tcactccagc actcatggcg 2760 
tgctatctac tggtcgccaa catcctgctg gtgaacctgc tgattgctgt gttcagcaat 2820 
accttctttg aagtaaaatc aatatccaac caggtgtgga agttccagcg atatcagctg 2880 
attatgacat ttcatgacag gccagtcctg cccccaccga tgatcatttt aagccacatc 2940 
tacatcatca ttatgcgtct cagcggccgc tgcaggaaaa agagagaagg ggaccaagag 3000 
gaacgggatc gtggattgag tatgttcctt agcgacgagg agctaaagag gctgcatgag 3 060 
ttcgaggagc agtgcgtgca ggagcacttc cgggagaagg aggatgagca gcagtcgtcc 3120 
agcgacgagc gcatccgggt cacttctgaa agggttgaaa atatgtcaat gaggttggaa 3180 
gaaatcaatg aaagagaaac ttttatgaaa acttccctgc agactgttga ccttcgactt 3240 
gctcagctag aagaattatc taacagaatg gtgaatgctc ttgaaaatct tgcgggaatc 3300 
gacaggtctg acctgatcca ggcacggtcc cgggcttctt ctgaatgtga ggcaacgtat 3360 
cttctccggc aaagcagcat caatagcgct gatggctaca gcttgtatcg atatcatttt 3420 
aacggagaag agttattatt tgaggataca tctctctcca cgtcaccagg gacaggagtc 3480 
aggaaaaaaa cctgttcctt ccgtataaag gaagagaagg acgtgaaaac gcacctagtc 3540 
ccagaatgtc agaacagtct tcacctttca ctgggcacaa gcacatcagc aaccccagat 3600 
ggcagtcacc ttgcagtaga tgacttaaag aacgctgaag agtcaaaatt aggtccagat 3660 
attgggattt caaaggaaga tgatgaaaga cagacagact ctaaaaaaga agaaactatt 3720 
tccccaagtt taaataaaac agatgtgata catggacagg acaaatcaga tgttcaaaac 3780 
actcagctaa cagtggaaac gacaaatata gaaggcacta tttcctatcc cctggaagaa 3840 
accaaaatta cacgctattt ccccgatgaa acgatcaatg cttgtaaaac aatgaagtcc 3900 
agaagcttcg tctattcccg gggaagaaag ctggtcggtg gggttaacca ggatgtagag 3960 
tacagttcaa tcacggacca gcaattgacg acggaatggc aatgccaagt tcaaaagatc 4020 
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acgcgctctc atagcacaga tattccttac attgtgtcgg aagctgcagt gcaagctgag 4080 
cataaagagc agtttgcaga tatgcaagat gaacaccatg tcgctgaagc aattcctcga 4140 
atccctcgct tgtccctaac cattactgac agaaatggga tggaaaactt actgtctgtg 4200 
aagccagatc aaactttggg attcccatct ctcaggtcaa aaagtttaca tggacatcct 4260 
aggaatgtga aatccattca gggaaagtta gacagatctg gacatgccag tagtgtaagc 4320 
agcttagtaa ttgtgtctgg aatgacagca gaagaaaaaa aggttaagaa agagaaagct 4380 
tccacagaaa ctgaatgcta gtct 4404 

<210> 43 

<211> 669 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7481427CB1 

<400> 43 

atgacggaac aagccatctc 
atctccaaga cagccgtggc 
gccagcatgc cgatggccgc 
atccccaaag atcagggcgt 
tactccccca cgcaagccct 
gcgggcgtgg acaagcacac 
ggcacggccg tcgtgtatcc 
aagtcaggca cggagcgcga 
tccgacggca tccggggcct 
taccaggcag cctacttcag 
aacacacaca tcctggtgag 
ctctcctaa 

<210> 44 
<211> 1823 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7483595CB1 

<400> 44 

aaaaaaaaac aaataaaaaa tgcccagtgt gagcatctgt agtcccagct atttgggaag 60 

ttgaagcaga gaatcccttg tgcccaggag tttgaggtta cagtgagcta tgattgcacc 120 

actgcactcc ggcctgggca acagagtgag atcctgtctc ataaataaat aaataggcaa 180 

aggcttgttc tgatcacctt gtccctgttt tccccaagtc aacctagaga gcagtagatt 240 

ctagggcaag aacttaagtg agctgactgc tcacaaagaa ggaaaggagt agtgtgagct 300 

agcacttgtt gggagcctct gatcaacaat acaaggaccc gtgttatcaa cttctgttca 360 

ttagggaaag agagttacaa gagttaccaa aggggcctat cttaactttt ctatggaaac 420 

ccttcttttc ctggagagtg ccattggtgc cattgctgga ttgaagacat tctaccccac 480 

tctggttttc tcttctctgc agctaataat gggcgtattg ggtttgggct tcattgccac 540 

ttaccttccg gagtctgcaa tgagtgctta cctggctgct gtggcacttc atatcatgct 600 

gtcccagctg actttcatct ttgggattat gattagtttc catgccggtc ccatctcctt 660 

cttctatgac ataattaatt actgtgtagc tctcccaaaa gcgaattcca ccagcattct 720 

agtatttcta actgttgttg ttgctctgcg aatcaacaaa tgtatcagaa tttctttcaa 780 

tcagtatccc attgagtttc ccatggaatt atttctgata attttacaag ccttctcctt 840 

atctttggtg agctcctttc tgctcatatt tctgggcaag aagattgcca gtcttcacaa 900 

ttacagtgtc aattccaacc aggatttaat agccatcggc ctttgcaatg tcgtcagttc 960 

atttttcaga tcttgtgtgt ttactggtgc tattgctagg actattatcc aggataaatc 1020 

tggaggaagc acaacagttt gcatctctgg taggcgcaga gctaagattc ttctcctggg 1080 

tcaaatccct aacaccaaca tttatagaag catcaatgat tatcgggaga tcatcaccat 1140 

tcctggggtg aaaatcttcc agtgctgcag ctcaattaca tttgtaaatg tttactacct 1200 

aaagcataag ctgttaaaag aggttgatat ggtaaaggtg cctcttaaag aagaagaaat 1260 

tttcagcttg tttaattcaa gtgacaccaa tctacaagga ggaaagattt gcaggtgttt 1320 



cttcgccaag gacttcctag. 
atcgatcaag cgggtccagc 
cgccaagcag tgcaagggca 
gctgtccttc tggaggggca 
caatttcgcc ttcaaggata 
gcagttctgc aggtactttg 
cctggatttc accagaaccc 
gttccgaggc ctgggagact 
ttaccagggc ttcagcgtct 
ggtgtacgat acggccaatg 
ctggatgacc gcgcagaccg 



ctggaggcat caccgccgct 60 
tgctgctgca gatgcagcac 120 
tcgtggactg catcgtccgc 180 
acctggccaa tgtcatccgc 240 
agtacaagca gatcttcctg 300 
cgggcaacct ggcctctggc 360 
gcctggcagc cgacgtggga 420 
gcctagtgaa gatcagcaag 480 
ccgtgcaggc catcatcatc 540 
gcatgttccc cgaccccaag 600 
tgacggccgt ggctggcgtg 660 

669 
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ctgcaactgt gatgatctgg agccgctgcc caggattctt tacacagagc gatttgaaaa 1380 

taaactggat cccgaagcat cctccgttaa cctgattcac tgctcacatt ttgagagcat 1440 

gaacacaagc caaactgcat ccgaagacca agtgccatac acagtatcgt ccgtgtctca 1500 

gaaaaatcaa gggcaacagt atgaggaggt ggaggaagtt tggcttccta ataactcatc 1560 

aagaaacagc tcaccaggac tgcctgatgt ggcggaaagc caggggagga gatcactcat 1620 

cccttactca gatgcgtctc tactgcccag tgtccacacc atcatcctgg atttctccat 1680 

ggtacactac gtggattcac gggggttagt cgtattaaga caggtaagta ctgaggaggc 1740 

cttggcagga gctctgatcc ccctcctacc atcccaaccc caccctgatc ctgattgaat 1800 

tctgttagtc actgatcagc tgc 1823 

<210> 45 

<211> 2931 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 3788427CB1 

<400> 45 

atgtccatag ttcgtttatc cgtccatgcc aaatggatta tggggaaggt gactgggaca 60 
aaaatgcaaa agactgctaa agtgagagtg atcaggcttg ttctggatcc ccatttatta 120 
aagtattata ataagcaaaa aacgtacttt gctcacaatg cccttcagca gtgcacaatt 180 
ggggatatcg tgcttctcaa agctttacct gttccacgaa caaagcatgt gaaacatgaa 240 
ctggctgaga ttgttttcaa agttggaaaa ctcgtagatc ctgtgacagg aaaacctcgt 300 
gcaggaacta cctatctgga gagtccattg agttcggaaa ccacccaggg cgtagacggg 360 
gcttcccgtc cctcacgcgg gcccgcccca tgccgggcgg ggcctggggc ccgccgcctc 420 
cgcccgtggc cggagagccc caggcccgag ccacgcggtc ttccgggccc gggtcggggc 480 
tcaatggcta cgtggaggcg ggacggccga ctgacaggcg gccaaaggct gctgtgcgct 540 
gggctggcgg ggacgctcag cctcagcctc accgcgcccc tggagctcgc caccgtgctg 600 
gcccaggttg gcgtcgtgcg aggccacgcc cggggaccgt gggccacagg gcaccgggtg 660 
tggcgggcag aggggctccg ggccctgtgg aaggggaacg cggtggcgtg cctgcgcctc 720 
ttcccctgca gcgccgtgca gctcgccgcc taccgcaaat ttgttgtgct gttcacagat 780 
gacctgggcc acatttccca gtggagctcc atcatggctg ggagtctcgc aggcatggtt 840 
tccaccattg taacatatcc tacagacctc atcaaaaccc ggttgatcat gcagaacata 900 
ctggaaccat cgtacagggg gctcctccat gctttttcta ctatttacca acaggaaggg 960 
ttccttgccc tttatcgagg ggtttccctc actgttgtag gtgctctccc gttctctgct 1020 
ggctcccttc ttgtttacat gaacctggag aaaatctgga acggaccccg agatcagttc 1080 
tctctcccac agaactttgc taatgtctgt ctggctgctg cagtgaccca gaccctctcc 1140 
tttccctttg agaccgtgaa gagaaagatg caggctcaga gcccctacct cccacacagt 1200 
ggaggagtag atgtccattt ctcaggagca gtggactgct tccggcagat agtgaaggcc 1260 
cagggggtcc tggggctctg gaatggattg acagccaatt tactgaagat agttccatat 1320 
tttggaatta tgtttagcac ctttgagttc tgcaagagaa tctgtcttta tcaaaatggt 1380 
tacattctgt ctccactgag ctataaattg accccaggag tcgatcagag tttgcagccc 1440 
caggaattac gagaattaaa gaagttcttc aaaacgagaa aactgaagcc taaaaaacca 1500 
actctataaa atggaatgga actagaagtg cactgactga cagcatgttg caatcacaga 1560 
taaatgtagc ctcataactg tgcacctgag gagggatgag aagctactgc tatctgctgc 1620 
tctgggaaat gagatggttt ggccatgcac cacctcagat ctccaaactg atgtccctgt 1680 
gaacacctgc actatgagaa ggtttcccaa caagagtgta tcatgatggc attccacgag 1740 
attttataac cagagtctag cagtgataaa atgtacaatt tacgctctga atagaaacat 1800 
ggcaccaaag aaagggcatt gccatgacat ctcaaagcaa caggtaattg aagtggctga 1860 
gtggatacca cgttgtgagg tcagccactg ggtcacttgt agtcaatgca aagcttgcac 1920 
aaaccaacgt cagcctagtt tcactcagtt tgacctgaga ctttccaata caatctgttg 1980 
ttttgttttg tttttagaga cagggtcttg ctctgtcacc cagactggag tacaatgaca 2040 
caatcatagc tcactgcagc ctttaactac tgggctcaag acatcctcct gcttcagcct 2100 
ccagagagtt gggaccatag gtgcacacca ccacacctag ctaatttttg ggggaggtct 2160 
tgctatgttg cccaggctgg tcttgaactc ctgggctcat gcaatcctcc tgccttggcc 2220 
tcccaaagcg ctaggattag aggtgtgagc cgctgcaccc tgccccagta caatcttttt 2280 
tgaactcaaa tttttgctga catctgagtg cacacaccac agtgtaaatt atgccttatc 2340 
agaatctaaa tgaaaatagc gaacatttaa aagctatcac cattgtagta gaatcatcct 2400 
tcttttttga aatttgaagc atcccaggct taaaatcttg tgtttcagaa agacagttta 2460 
taccatgact gcttaattat ccccccaaag accttctgat tgaagtcatg tacagttcag 2520 
tggcctaaat tctctgcctt tttaacttgc tttgcaagcc tactctgaaa ataagttatt 2580 
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tagtcaagtt attctcaaag atgtcccagt tgcctagaaa ggatcaaatg gaacatttga 2640 

cacacatact caaaaaaatg taactgacta taaacacttt aacctaatca tctgtatcaa 2700 

actttctaaa aatcaaatct caggattgtt ccactttaga gattctatgt aaagtttata 2760 

taactatact tgtcaaatag cacctatcta tgcatttaaa aatgcattaa tatctaacat 2820 

aaaattgtta atttatacac acacatacac ttatacatag agagagattt tcggtatgtt 2880 

gcaagtattt tttccgatta aatttatttt ccaaagaaaa aaaaaaaaaa a 2931 

<210> 46 

<211> 1492 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6972455CB1 

<400> 46 

cttgctcctt gatcagtttg cggatggcct gcaaggccgc ttccggtacg gcttgggcaa 60 
tggaattcat aaaaagctca gggttcccga tagcgtgtgt gacccgttgt aaaactattc 120 
atagctggct gcaagccacg ccccaaaggc gctcgcggct ttttgcaaac gctcgaaaag 180 
ggtgcgggaa acctgaccac tgttcagacc agtaagacca ctgcacctca acaaacatca 240 
gccttgcaga cgatccgaac ggtgctggca tgggctgtgc actgggcaaa ctcagaaaat 300 
tccttcgcct ttttcggaga gctgccatga ccaagcgcta cagcgccctg ctcactgccc 360 
tgtttgccag cctgatgctg agccaggcac ccgcccaggc gagtggtctg gatgacatcg 420 
tcgcccgtgg caccctgaag gttgccgtcc cccaggactt cccgccgttt ggttcggtcg 480 
gccctgacat gcaaccccgg ggcctggaca tcgacaccgc caagctgctg gccgaccaac 540 
tcaaggtcaa gctggaactg acgccggtca acagcaccaa ccgcataccg ttcctcacca 600 
ctggcaaagt cgacctggtt atttccagcc tgggcaagaa ccccgagcgc gcaaaggtca 660 
tcgacttctc caacgcctac gcgccgttct accttgccgt gtttgggccg cctgatgccg 720 
ccatcgccag cctcgacgac ctcaagggca agaccatcag cgtgacccgt ggcgccatcg 780 
aagacatcga gctgaccgcg gtagcgccca aggaagccac gatcaagcgt tttgaagaca 840 
acaactccac catcgccgcc tacctggccg gccaggtcga cctgatcgcc agtggcaacg 900 
tggtcatggt ggccatcagc gaacgcaacc ccaagcgcgt accggcactg aaggtgaaac 960 
tcaaggattc gccggtgtac gtaggcgtga acaagaacga gccggcgctg ctggaaaagg 1020 
tcaaccagat cctggtggca gccaaggctg acggcagcct tgggaaaaac gcgatgcagt 1080 
ggctcaaaga accgctgcct gccgacctct gacgcggagc tcaaagatgg cctatcaatt 1140 
cgactttgtg ccggtgctgg ccaataccga cctgctcctg cgtggcgcgc tgttcaccct 1200 
tgagctgacg gccattggcg ccatcctcgg cgtggccctg ggcactgtcg gcgccgtggt 1260 
acgggcgtgg aagatccaac cgtttgcgtg ggtctttggt gtgtatgtcg agttgatccg 1320 
caacacgccg ttcctggtgc aactgttctt catcttcttc ggcctgccat ccctcgggtt 1380 
gaaaattacc gagtggcaag ccgccgtgct ggcgatggtg atcaacctgg gggcctactc 1440 
caccgagatc atccgtgccg gcatccaggc catcccacgg gggcaactgg ag 1492 

<210> 47 

<211> 2406 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 8077668CB1 

<400> 47 

atgtatgacc caccaaaagt gaataagaaa aatgactctg ctgatcattc tgatgaatct 60 
gtggaaatac agtcagccct tcatatctgt ggatttgacc aaggcaggtg gatcctctcc 120 
taccgccggt cacccgcgcg ccggcacacg ggcctggccc cagtacttgg ggtgcgggtc 180 
cgcgtctgcg tggggtgcca aggggcccgg gtcccagcgt ggcccaaacg tgagggtagg 240 
ggctccgcgg aggagccagt agagagaaag gatggacaac aaggggagag gagaggagaa 300 
gccagcctga ccaaggatgc tgttaccttc cagagcagtc tccctgtcca gatcaccagg 360 
atccctgctt gctggagaat gacccagatg tcccaggtgc aggagctctt ccatgaggca 420 
gcccagcagg atgccctggc ccagccccag ccctggtgga agacccagct gttcatgtgg 480 
gagcctgtgc tgtttgggac ctgggatggt gtgttcacat cctgcatgat caacatcttt 540 
ggggttgtgc tcttcctgag gactggctgg ctggtgggaa acacaggagt gctcctgggc 600 
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atgttcctgg tgtccttcgt catcctggtg 
gtcggggagc gcagcagcat cggcagcggt 
ggtgggcaga cgggaggcac catcgggctg 
ggctttgctg aatccatctc ggatttgctg 
atttcagttg cggtgcttct ggccttgctg 
atccgcctcc agctgctgtt gctgttcctg 
ggttctttca cccacctgga cccagaacat 
cagaacaaca cgctgcccga ttacagcccg 
ttcttcccag cggctacagg agtcatggcc 
cctgccgcca gcattcccct gggctccctg 
atcggctaca ggagtcatgg ccggcttcaa 
cagcattccc ctgggctccc tggcagctgt 
cagtctgagc tagagagtgg gagagaaaac 
tctgagagtg ccccgactgt gaccaatcta 
ggtgggctgg aaaattcacg tctggtgggt 
gggaatgctg ccgaggagga gtgtcgggac 
ttcctcggcc tcccgccccc gtccctcccc 
aaccaccgag tttccaggat cactggaggg 
cgaattgggg ctgcatctca gctctgccgc 
tgagttctct gagcctccat ttcctttctt 
ttcctagatt gcggcgggga gagaaggcac 
cttcccccag tgagcctccc ctccagctcc 
tcacactgtg ccacccacac aagcaggctg 
gaatattgtt tctctctgta ccacaggaag 
gaagaggcaa agagatcaga gaccagaaac 
tattctaacc acagtattta gaaaatatta 
tcttagctta ctgcagtgaa accattgcta 
gtgcaataac attgggaaga aaatgaagac 
aataaaagaa agaggcttct gaatgaggac 
ttccaggagg aacccttggc ccatcctctg 
gaacct 

<210> 48 

<211> 3686 

<212> DNA 

<213> Homo sapiens 



gccctcgtca cggtgctgtc tgacattggc 660 
ggcgtctact ccatgatctc ctcggtcctg 720 
ctctatgtgt ttggacagat gtatatcacc 780 
ggcctcggga atatctgggc tgtgcgagga 840 
ggcattaacc tcgcaggtgt caaatggata 900 
ctggccgtgt ccacactgga ctttgtggtg 960 
ggtttcattg gatattcacc cgaactgcta 1020 
ggggaatctt ttttcactgt ctttggggtt 1080 
ggcttcaaca tggggggcga cctcagggag 1140 
gcagctgttg gcatctcgtg gtttctgtac 1200 
catggggggc gacctcaggg agcctgccgc 1260 
tggcatctcg taagttcccc tcatggcaat 1320 
cccaggcaga agagggattg ccccttttgt 1380 
tttcatattc cgatccttct tttaacactc 1440 
tcagacactg ccctttcatg cgccctgcct 1500 
ccagatcaca gatcgaatcc tttctttcct 1560 
cacccccatt ttgaacttgg ggattttttt 1620 
agcataaaga tgaagagttt tgcaatcaga 16 80 
tgattggctg catgactctg gacaatttac 1740 
gtgaaatggg gataatgaca ctgtatgtgc 1800 
acagtggaag ctcagcaccc agctcttctt 1860 
cccaggccgt caggcctctg gggttttagt 1920 
ggtaaccttg ggtctatttc tgacctttta 1980 
aagcttctct gcccttcatt gggtaggtgt 2040 
aatgacttga gaagttaaaa gggcctgtgt 2100 
atgtatctgt ttcacaaaca ctgaatttgt 2160 
tcagcctggc gtacagcatc agggtatttt 2220 
aacagagata gagttgagtg ttgggaagga 2280 
cctgtcttgt ttctggcccc agctgggagg 2340 
attacccaga tgggaatgag gagaccacag 2400 

2406 



<220> 

<221> misc_feature 

<223> Incyte ID No: 55120485CB1 



<400> 48 

atggaagaga acagtaagaa ggaccatcgg 
ctggaggtgt ttggttaccg ggaccacaat 
gtgctgacct gtgggggcct tctgctggtg 
gccaactgca tcccatgccc cttgcaagaa 
gaatttcaaa gatatatgag gaagaaggta 
cctgtaagca agaagtggga agaatccctg 
gccttaataa agccagaatt aaaactgcgg 
tggaacgacc tggagaagcg gtttcagaaa 
tctgacatcc atcagacatt tggattgggt 
ttagtgtgtg ggcccaacgc cattgaggtt 
aaacaggttt taaatccatt ctatgtgttc 
caaggttaca tagaatactc tgtggccatc 
agtgtgtatg atttgcgaca gcaatcagtt 
aaagtccagg ttacaatcat tgtaaaagac 
ttggttcccg gagacattct tattcttcca 
ttgattgatg gaagctgcgt ggtgaatgaa 
acaaagacac cattgcccca gatggagaac 
gattatagga aacacgtcct tttctgtgga 
caggggcctg tacgagcagt cgttttgcaa 
gtgagatcca tcctgtaccc ccggcctctg 
ttcatcgtgt tcctggcctg ccttggtgtc 



gctttgctca accagggaga ggaggatgaa 60 
gtacggaaag ccttctgcct tgtcgcatcc 120 
ttctactgga gaccccagtg gagagtgtgg 180 
gcagacactg ttttgctgag gacaacagac 240 
ttctgcctct acttatacac actgaagttt 300 
gtggctgacc gccactctgt cataaaccaa 360 
tgcctggaag tgcagaaaat caggtatgtt 420 
gttgggttgc tagaagacag caattcctgc 480 
ctgaccagtg aagagcaaga ggtcagaaga 540 
gaaatccaac ccatatggaa gctgcttgtt 600 
caagccttca ccctaacttt gtggctgtct 660 
atcattttga ctgttatctc cattgtctta 720 
aagctgcata acctcgtgga ggaccacaac 780 
aaaggtttgg aggagctgga atcccgtctc 840 
ggaaaatttt cattgccatg tgatgctgtt 900 
ggcatgctta caggagaaag tatacctgtt 960 
actatgcctt ggaaatgtca cagtttggag 1020 
acagaagtta tccaggtcaa gccctctggg 1080 
acaggttaca atacagccaa aggggactta 1140 
aacttcaaac tatacagcga tgccttcaag 1200 
atgggttttt tctatgccct aggggtatat 1260 



68/83 



atgtaccatg gagttcctcc aaaagatact gtgaccatgg ccctgatcct cctcaccgtg 1320 
actgtccctc cagtgctgcc agctgccctg accataggca acgtgtatgc tcagaagaga 1380 
ctgaagaaaa agaaaatctt ctgtatctcc ccacagagaa tcaacatgtg tgggcaaata 1440 
aacctcgtgt gctttgacaa aactggcact ctcactgaag atgggctgga cctctggggg 1500 
actgtcccta ctgctgacaa ctgcttccag gaagcccaca gctttgcctc aggccaggct 1560 
gtgccatgga gcccactgtg tgcggccatg gccagctgcc actctctgat ccttctcaat 1620 
gggaccatcc agggagaccc tctggacctc aaaatgtttg agggcactgc ctggaaaatg 1680 
gaagattgca ttgtagactc ctgcaaattt gggacgtcag tttcaaacat cataaaacca 1740 
ggaccaaaag ccagtaagag tccagtggaa gccatcatca ccttgtgcca gtttccattt 1800 
tcctcgagcc tgcagaggat gtccgtgatc gctcagctag ctggggagaa tcatttccat 1860 
gtctacatga aaggtgcccc agaaatggtg gecaggttet gcagatctga aacagtgccc 1920 
aagaatttcc cacaggaact gaggagttac acggtgcaag gcttccgtgt cattgetett 1980 
gcccacaaaa ccttaaagat ggggaatctt tcagaagtcg agcacttagc cagagaaaaa 2040 
gtggagtcag agttaacatt tctgggactt ctcatcatgg agaatcgett gaaaaaagaa 2100 
accaaactgg tcttgaagga actgagtgag gcccgtatca ggactgtgat gattacaggt 2160 
gataaccttc aaaeggecat tactgttgca aagaattctg aaatgatccc tccaggcagc 2220 
caagtgatca ttgttgaggc cgatgaacca gaagaatttg ttcctgcctc tgtgacctgg 2280 
cagctggtgg agaaccaaga gactggacct gggaagaaag aaatctacat gcatactgga 2340 
aacagttcaa cccctcgtgg ggaaggagga agctgttacc attttgeaat gagtgggaaa 2400 
tcataccaag tgatatttca gcatttcaac agettgette caaaaattct ggtgaatgga 2460 
acagtttttg caagaatgtc tcctgggcag aaatcaagee ttattgaaga atttcagaaa 2520 
ttaaatgect gcacagtaca aaatgaaagc atctcagagt taaccatgtc tccaactgct 2580 
ccagaaaaaa tggaaagtaa tagcaccttc acaagttttg agaacactac agtctggttc 2640 
ttgggaacaa tcaactgtat cactgtggct cttgtgttct ctaaaggaaa accatttaga 2700 
cagccaactt atacaaacta tatatttgtc cttgtgctga taatacagct tggtgtatgt 2760 
ctattcattc tatttgetga tataccagaa ttatatagac gtttggatct gctctgcact 2820 
cccgtcctgt ggagggcetc cattgtcatc atgetcaget tgaatttcat tgtgtccctt 2880 
gtggccgagg aggctgttat tgaaaatcga gccctgtgga tgatgattaa aagatgtttc 2940 
ggctatcagt caaaaageca gtateggata tggcagaggg acttggcaaa tgaccctagt 3000 
tggcccccgc taaaccaaac ctcccactct gaeatgeegg agtgtggcag aggagtgtct 3060 
tacagcaatc cagtatttga gagcaatgaa gaacaactct gaaggacatg tgatatccaa 3120 
gttgatatta cagagaaaca atgacaaaag ggaccagttt cattgatttt aagaaatgag 3180 
actcttgtaa catcagctgg agttttgggg ctacctatca aatcatggtg acaaactaaa 3240 
gtctgtttga taaaatactg tctacagaga agaacattga cccagaagag tgctttcctc 3300 
ttggcttctg aaggtattag aggcattcag tgaaggcaag aatgggccaa gaatttctat 3360 
gggecaagag acaagcaggc taaaggggaa aaatatgtga tctttcctct taggacagga 3420 
caatactttc ctttttataa aaatgagttg aggggaagta gtactgtgta tactcaegga 3480 
gttgtaattt ttgaattttt acttcatcga tggaaaaact aatgcgtggg aaacattaca 3540 
ccaatcttta ctgtctgggt tccttttctc tgtagtactt ttcagatttg aggctttttc 3600 
ttctgtttct ggccttaccc aacatattgt taaggataag ettagectat getgeaataa 3660 
aaaacaactc tcaaaaaaaa aaaaaa 3686 

<210> 49 

<211> 2807 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3112883CB1 

<400> 49 

aagaattegg cacgaggcca cagccgcggg tgcgtggtcg tgcggcaagt gagegatttg 60 
gtgaacagac actcccttct gtagccatgg attgttacag aacttcacta agcagttcct 120 
ggatttaccc cactgtgatc etctgettat ttggtttttt ctccatgatg agaccctcag 180 
aaccattcct tatcccatat ttatctggac cagataaaaa cctgaccagt gcagagataa 240 
caaatgagat cttccccgtt tggacatact cctacctggt getgetgetg cctgtgtttg 300 
tcctcaccga ttatgtccgc tacaagecag tcatcatctt gcaaggtatc agtttcatca 360 
ttacctggct gctgctgttg tttggccaag gagtgaagac catgeaggtt gtagagttct 420 
tctatgggat ggtcaccgcc gccgaggtgg cctactacgc ctacatatac agcgtggtca 480 
gccccgagca ctaccagaga gtgagegget actgeaggag cgtcacgctg gccgcctaca 540 
cagcagggtc ggtgctggct caactcttgg tatccctggc gaacatgtcg tacttttacc 600 
teaaegtcat atccttggcc tctgtctccg tggctttcct tttctcactt ttcctaccaa 660 
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tgcccaagaa aagcatgttt tttcatgcaa aacccagcag agaaataaag aagtcatcaa 720 
gcgtgaatcc agtattagag gaaactcacg aaggtgaagc accaggctgt gaagagcaga 780 
aacccacatc agaaatactc agcacttcag ggaagctgaa taagggccag ctgaacagcc 840 
tgaaaccaag caatgtgact gtggacgttt ttgtgcagtg gttccaagat ttgaaggagt 900 
gctactcctc aaaacgtctt ttctactggt ctctatggtg ggctttcgcc acagcaggtt 960 
ttaaccaggt tttgaactat gttcaaatcc tgtgggatta caaggcgcca tcccaagatt 1020 
cttccatcta taatggggcc gtagaagcta ttgcaacctt tggaggggct gtggctgcct 1080 
ttgcagtggg ttatgtgaaa gtcaactggg accttctggg agagctggct ctggtggtct 1140 
tctcagttgt caatgccggt tctttatttc tcatgcatta cacagccaat atctgggcgt 1200 
gctatgctgg ctatttgata ttcaagtcca gctatatgct tcttataacc atagcagtat 1260 
ttcagattgc agttaatctg aatgtggaac gctatgcctt ggtatttgga atcaacacct 1320 
ttattgcctt ggtgattcag accatcatga ctgtgattgt agtagatcag agagggctca 1380 
acttgccagt cagcattcag tttttagttt atgggagcta ttttgcagta attgctggaa 1440 
ttttcctaat gagaagcatg tatattacct actcaaccaa atcccagaag gatgtacaga 1500 
gccctgctcc aagtgagaat ccagatgtgt ctcacccaga ggaagagagt aatatcatca 1560 
tgtcaacaaa actctaacct catcgcaaca aacgcaacag tggctttcaa agttatgcaa 1620 
taataaggaa agattttgag atgggtggca tatgttttgc cataacttga catgctttgc 1680 
aaatctggat tccaatggac ctttcaaaac cacaacaaaa cctcagtttt agatgagttc 1740 
tctatgtgac caattttact ggatgcaatt gacaggaccc gtcatcataa ttaaacaacc 1800 
catattgggg accccctgtg actagtagca gctggaaaat tctggttttt atcacttgta 1860 
aagacatgca gatggcgtgg aaccaaacca tgagaaaact ccagccatcc tggagttgat 1920 
attcaccatt tgtgaggaga aatactaact ggactgaccc tattgctagg cttaaatact 1980 
tatttgatct taccaaagaa gtcaacacat gggacctttg tgtcacatga accattttct 2040 
ttcctcttct attaagtgta tttctgttta agttacagtt ctctaagaga attacaatgt 2100 
ttgtcccatt tctaagggct tctcttcaac tctaataaca gcattattca cgttatgatt 2160 
tgatagtatt attatttaat ttttttatga ttattttcca ttttgtgctc tgagttttgc 2220 
tgttgaaagt ctccctcaag aatagcttca gatcctcttg tgtatttgca gaatacacaa 2280 
ggtcatttcc cagtggcctg ggagaggcag tgagccttct ctccaccacc atagacaggt 2340 
gttaatgcat ctatgggcca ggtgcagtgg ctcactcctg taatccgagc aatttgggag 2400 
gccaaggtgg gagttttgct tgaggctggt ctcaaactcc tgacttcagg tgatctgtct 2460 
gcctcggcct cccaaagagc tgagattaca ggcgtgagcc accatgcctg gccaaaacac 2520 
ctgattttta aatggtagca attgtgctaa ataacatgtg tcaacaattt tcaaagagtt 2580 
gttaacattt tttgatgaat tctgctgaaa attatctgaa catacctgtt tttcgaaact 2640 
tcaagaaact ttcaataagg gatgcattga gccgtctgac agtgagaata acgatgctga 2700 
gtcagatttt attaggctgg acttgtggtt acaactttat cccgcatttt gggaagctaa 2760 
gtgagaggta attgaggttg gggtttaaac ccttggcatt ttaacct 2807 

<210> 50 
<211> 2170 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4253888CB1 

<400> 50 

gtgaatgccc cttccatccc ccgcccctgg tgtctcggtc tggtctgagg agacggggac 60 
ccttctcaca ccggccgctg ccgccgccgc gctccggaac agatccagtc cttctgtgga 120 
acttctgaac atcttttatt agtggaaata ttttctacac aatgaagtca acaacttaat 180 
ttaaaccagt gtttgtgcgg ttctgattca tctgctgtgg ttcccgaagc ttgagatcta 240 
aggagtacag ggtcttttgt gatgacaata tgactaatag taaaggaaga tctattaccg 300 
ataaaacaag tggtggtcca agtagtggag gaggttttgt agattggact ttacgtttaa 360 
acacaattca atccgacaag tttttaaatt tactcttgag tatggttcca gtgatttacc 420 
agaaaaacca agaagacagg cacaaaaaac caaacggcat ttggcaagat ggattatcaa 480 
ctgcagtaca gacttttagt aatagatctg agcaacacat ggagtatcac agtttctcag 540 
agcagtcttt tcatgccaat aatgggcacg catcatcaag ctgcagccaa aagtatgatg 600 
actatgccaa ttataattac tgtgatggaa gggagacttc agaaaccact gccatgttac 660 
aagatgaaga tatatctagt gatggtgatg aagatgctat tgtagaagtg accccaaaat 720 
taccaaagga atccagtggc atcatggcat tgcaaatact tgtgcccttt ttgctagctg 780 
gttttggaac agtttcagct ggcatggtac tggatatagt acagcactgg gaggtgttca 840 
gaaaagttac agaagttttc attttagtcc ctgcacttct tggtctcaaa gggaacttgg 900 
aaatgacatt ggcatccaga ttatccactg cagtaaatat tgggaagatg gattcaccca 960 
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ttgaaaagtg gaacctaata 
tgggttttct agcagctgtg 
accttgatca ttccatactt 
ttctgcaggg aataataatg 
ctgataatgt tgctacaccc 
tggcttggat aagtcagggc 
tagttggtgt atttttcttg 
cagccacaag aacagttctc 
gtagcattgg gggccttatt 
ttgtttacac gccagttatt 
ggatttctac ctacctccat 
gttgttacta cccatttaga 
ttctactgct tttagtgatt 
aaagtggtca tacttcttta 
tacaggtatt taccttgctg 
gaaaggaccc ggatagtttc 
cagctctgtt agccttaagt 
ttggagacta ataaattcta 
accacttatg gctctttttc 
cttcagttgg ccctgattca 
aaaaaaaaaa 

<210> 51 
<211> 1722 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 7479974CB1 

<400> 51 

atggatgctg ttaaatactt aaataaattg aatttagata acattgagtt aacaaaatat 60 
ttgtttttta ctggtaaagg tggcgtaggc aaaacaacga tatcaagttt tattgcttta 120 
aacttagcag agaatggaaa gaaagtagct ttagtaagta ctgatccagc tagtaattta 180 
caagatgtat ttcaaatgga attatctaat aaattaacta aatatcaacc tatacctaat 240 
ctctctatag ccaatttcga cccgattgtt gctgcagacg attataaagc acaatctata 300 
gaaccttatg agggtattct accagaagat gtgcttgctg aaatgaaaga acagttaagt 360 
ggttcatgta cagttgaagt agcagcattt aatgaattta caaatttttt atccgataaa 420 
actttagaac aagaatttga tttcattata tttgatacag ctcccacagg tcacactttg 480 
agaatgcttg aattaccttc tgcatggaca gattatttaa atacaacgag taatgacgct 540 
tcttgcttag gtcaattatc aggtttaaat gaaaatagag ttaaatataa ttcagcactt 600 
gaaaaactac gtaaccaaga tgatacgacc atgatgttag ttgcgagacc tactcactct 660 
tctatatatg aaattcaaag agcgcaacaa gaattacaac aactgtcaat ttctaaattc 720 
aaagtaatca ttaacaacta tatagaagaa agtcacggtt taatttcgag tcagatgaaa 780 
tcggaacaag ataaaaacat taatcatttt actgaatggt taaataacaa tcatgcttat 840 
tacgttccat ataaaaagca gaaagaagaa ggtatagaaa atttaactaa tctattaaat 900 
gatgataact taattgaaaa tgatgacttt attgttgaag atcatccgca attcaataaa 960 
ttaatcgatg aaattgaaaa tagtaaagtt caatatttat ttacaatggg aaaaggtggc 1020 
gttggtaaaa cgacagtagc aacgcaatta gctacagcat tatctaataa aggatatcgt 1080 
gttcttttag caactactga ccctactaaa gaaattaatg ttgaaactac aagtaattta 1140 
aatactgctt atattgatga agaacaagca ttagaaaagt ataaaaaaga agtactagcc 1200 
acagtgaatg atgatacacc acaagacgat attgattata ttatggaaga tttaaaatca 1260 
ccttgtacag aagaaatagc atttttcaaa gcctttagtg acattatgga gaatcaagac 1320 
gacatggatt acgtcattgt agacacagct cctacaggcc ataccttgct gttacttgat 1380 
tctagtgaaa atcatcatag agaattaaag aaaaaatcaa ctcaaactac cagtaatgtt 1440 
gaaacattat tacctaaaat tcaaaataaa aatttaacac agatgataat cgtaacacta 1500 
gcagaaaaaa caccttattt agaatctaaa cgtttagtag aagatttaaa tagagctaat 1560 
ataggccata attggtgggt tgttaatcaa tcgttagtta cgctaaatca acgtgatgac 1620 
ctttttagta acaaaaaaga agatgaatca ttttggataa acaagattaa aaatgaaagt 1680 
tttgacaatt attttgtcat accttatggg gggttatcat aa 1722 

<210> 52 



attggcaact tggctttaaa 
gcagcaatta tattgggctg 
ctgtgctcta gcagtgtggc 
gttggggtta tcgttggttc 
attgctgcta gttttggcga 
ttatactcct gtcttgagac 
gctctaaccc ctatttggat 
cactcaggct gggagcctgt 
ctggacacaa ctgtatcaga 
aatggtattg gtggtaattt 
ttacatagca ttccaggaga 
actttctttg gtccaggagt 
cctggacatt taattttcct 
actataatct tcatagtagt 
tggattgctg actggatggt 
tccatcccct acctaacagc 
tttcattttc tttggcttat 
caaactgctc tcaagttacc 
aaaactctta aatcagtagt 
attaaatggc cttaattttt 



gcaggttcag gcaacagtag 1020 
gattccagaa ggaaaatatt 1080 
aactgccttc attgcatctc 1140 
aaagaagact ggtataaatc 1200 
ccttataact cttgccatat 1260 
ctattactac atttctccat 1320 
tataatagct gccaaacatc 1380 
cataacagct atggttataa 1440 
cccaaacttg gttgggattg 1500 
ggtggccatt caggctagca 1560 
attgcctgat gaacccaaag 1620 
aaataataag tctgctcaag 1680 
ctacactatt catttgatga 1740 
gtatttattt ggcgctgtgt 1800 
ccatcacttc tggaggaaag 1860 
attgggtgat ctgctcggga 1920 
tggagatcga gatggagatg 1980 
aaggaagaaa atacacgaca 2040 
ttgacttttg ccagggtaat 2100 
ttttaaggaa tttgtgtcaa 2160 

2170 
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<211> 1424 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7483850CB1 

<400> 52 

ctgatgtcta tttaggaaga aacaaactgg tgtatatttg cttgtttcta cattttctag 60 
gcactgcttt gttatctgtg gtggcttttc ccttggaaga tttctatctg ggcacttacc 120 
atgcagttaa caacatacca aaaacagagc agcacaggct gttttatgta gcactgctga 180 
ccatttgcct tggcattgga ggcgtaagag ccatcgtctg tccactgggt gcttttggcc 240 
ttcaggagta tggatcacaa aaaacgatgt ctttttttaa ctgattcctt caggatatta 300 
tctgcaaact atgaattcca acctgaattt ggatggattt cttctgccga ttgcagtaat 360 
gaatgccatc agcagcctac cattactaat tctggctcct tttctggagt atttcagcac 420 
ctgcctgttt ccctctaaga gagttggatc atttctgtca acatgcatca ttgctggaaa 480 
tctttttgct gcattgtctg tgatgatagc tggcttcttt gaaatacacc gaaaacattt 540 
ccctgcagtg gagcagcccc tttcaggaaa agttctcact gtttcctcca tgccctgttt 600 
ctacctgatt cttcagtatg ttttacttgg agtggcggaa acactggtaa acccagccct 660 
ctctgtaata tcatacagat ttgttccaag caatgtcaga ggaacctcca tgaattttct 720 
gacactgttc aatggatttg gctgtttcac aggggcactg ctggtgaagt tggtatatct 780 
catctcagaa ggtaaaaata ggcaatggtt tccaaacaca ttaaacaaag gcaatttaga 840 
aggcttcttc ttcttcctgg catcattaac attgttgaac gtcctgggat tctgcagtgt 900 
ttcacaaaga tattgtaatc taaatcattt taatgcccag aacatccgtg gaagtaatct 960 
tgaagaaaca cttctcctcc acgaaaaatc tctgaaattt tatggcagta tacaggaatt 1020 
ttcttcaagt attgatcttt gggagacagc cctatgaaac tgtgtttgag tctacctgtc 1080 
ttatgagaac agtattcatt gttttcttct tcagttgagc attttatgtg ttttagtgta 1140 
aaagccaata tatacagagg tgattttaca aatatcatct gtatacttta tttacaaaga 1200 
ctaattatga attttctagt aaactgtcat gcttttgcat taacatgtta tagcattaga 1260 
atttaataaa ataatttcaa agtctcagtg attactctat gacaaaaaaa aaaggggggc 1320 
gccgattagt gagcttcgtc gaccgggaaa taattccgga cggacctgca ggcgtaggcg 1380 
atcctgagat tccatcaatg ggggcgtcga gcatgtttaa aggc 1424 

<210> 53 

<211> 3598 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5508353CB1 

<400> 53 

caatttgcaa cacagtagtg gtttctgctc ctaaccaacc ccgacaaaag atcagacacc 60 

cttcactggg ggggttgccc attaagtctt tggaagagat taaaagtctt ttccagagat 120 

ggtctgtccg aagatcaagt tctccatcgc ttaacagtgg gaaagagcca tcttctggag 180 

ttccaaacgc ctttgtgagc agactccctc tctttagtcg aatgaaacca gcttcacctg 240 

tggaggaaga ggtctcccag gtgtgtgaga gcccccagtg ctccagtagc tcagcttgct 300 

gcacagaaac agagaaacaa cacggtgatg caggcctcct gaatggcaag gcagagtccc 360 

tccctggaca gccattggcc tgcaacctgt gttatgaggc cgagagccca gacgaagcgg 420 

ccttagtgta tgccgccagg gcttaccaat gcactttacg gtctcggaca ccagagcagg 480 

tcatggtgga ctttgctgct ttgggaccat taacatttca actcctacac atcctgccct 540 

ttgactcagt aagaaaaaga atgtctgttg tggtccgaca ccctctttcc aatcaagttg 600 

tggtgtatac gaaaggcgct gattctgtga tcatggagtt actgtcggtg gcttccccag 660 

atggagcaag tctggagaaa caacagatga tagtaaggga gaaaacccag aagcacttgg 720 

atgactatgc caaacaaggc cttcgtactt tatgtatagc aaagaaggtc atgagtgaca 780 

ctgaatatgc agagtggctg aggaatcatt ttttagctga aaccagcatt gacaacaggg 840 

aagaattact acttgaatct gccatgaggt tggagaacaa acttacatta cttggtgcta 900 

ctggcattga agaccgtctg caggagggag tccctgaatc tatagaagct cttcacaaag 960 
cgggcatcaa gatctggatg ctgacagggg acaagcagga gacagctgtc aacatagctt 1020 

atgcatgcaa actactggag ccagatgaca agctttttat cctcaatacc caaagtaaag 1080 

atgcctgtgg gatgctgatg agcacaattt tgaaagaact tcagaagaaa actcaagccc 1140 
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tgccagagca agtgtcatta agtgaagatt 
ggttacgagc tggactcatt atcactggga 
tgcaaaagca gttcctggaa ctgacatctt 
caccgctgca gaaaagtgaa gtggtgaaat 
ttgctattgg tgatggtgcc aatgatgtta 
gggtctcagg tcaagaaggc atgcaggctg 
tcaaacatct cagcaagctc cttcttgtcc 
acatgattct ctattttttc tataagaatg 
agttcttttg tggattttca ggaacatcca 
acctcctctt cacatctgcc cctcctgtca 
cagagaccct catgcaactg cctgaacttt 
taccccatac cttctggatc accttattgg 
ttgtgcctta ttttacctac cagggctcag 
tgaacacagc cgctctgttc atcgttctcc 
cttggattca cttgctggtc atcattggta 
tttttggagc catgtgtgta acttgcaacc 
agcacatgct ggatccagta ttctacttag 
tgcccaggtt tgtatacaga gttcttcagg 
ctaagcactt tgacagacta actccagagg 
gggctggaaa gatgaatcaa gtgacatcaa 
gaagaagacc catgcctggc ccttctgctg 
ctattgagca aggaaactta tctctgtgtg 
ctaaggcctt tgagatggct ggaccctcca 
agttgcaagt attctttcaa ggttggaaga 
gaatgacttg tttttccata agggacatga 
gatgagcatt attgtatgtt tgtatataca 
agagagttta aggaagtgaa atatttaatt 
gattttgtaa aagcattttc attctcttag 
gagatatatt tattttacta ggagatctta 
ctccagtcgg aattttttta agaaaaaagt 
gaaatacgga aaaaacattg tagatggtta 
aacaacaagg tacctaaaat agggtataat 
taatagcttt ttatttccta tggggagatg 
tacctctcac tcatttcaat gttttcctga 
ggattctagg gtttcatcag ggaccaggaa 
aagacctttt atgagatctg cctctgtata 
ggtcaggaga catcagctca gcctgtggcc 
agtagcctga tccaacgtga tatcaatcat 
aggtcattaa gttttacaat tacctggaga 
ttaacacgga actatacagt ggtccggagg 
tcggcgggag tcacacagtc cctcggactt 

<210> 54 

<211> 1485 

<212> DNA 

<213> Homo sapiens 



tacttcagcc tcctgtcccc cgggactcag 1200 
agaccctgga gtttgccctg caagaaagtc 1260 
ggggtcaagc tgtggtctgc tgccgagcca 1320 
tggtccgcag ccatctccag gtgatgaccc 1380 
gcatgataca agtggcagac attgggatag 1440 
tgatggccag tgactttgcc gtttctcagt 1500 
atggacactg gtgttataca cggctttcca 1560 
tggcctatgt gaacctcctt ttctggtacc 1620 
tgactgatta ctgggttttg atcttcttca 1680 
tttatggtgt tttggagaaa gatgtgtctg 1740 
acaaaagtgg tcagaaatca gaggcatact 1800 
atgcttttta tcaaagcctg gtctgcttct 1860 
atactgacat ctttgcattt ggaaaccccc 1920 
tccatctggt cattgaaagc aagagtttga 1980 
gcatcttgtc ttatttttta tttgccatag 2040 
caccatccaa cccttactgg attatgcagg 2100 
tttgtatcct cacgacgtcc attgctcttc 2160 
gatccctgtt tccatctcca attctgagag 2220 
agaggactaa agctctcaag aagtggagag 2280 
agtatgctaa ccaatcagct ggcaagtcag 2340 
tatttgcaat gaagtcagca agttcctgtg 2400 
aaactgcttt agatcaaggc tactctgaaa 2460 
aaggtaaaga aagctagata ccctccttgg 2520 
gggattttga agaggtatct ctccaagcaa 2580 
gcattttact aggcttggaa gagctgacat 2640 
tttgtgatag agggctagag tttgacctag 2700 
cagaaccaaa tgcttttgta aaactttttg 2760 
aaattcaagt attttcaagg ggagtcattt 2820 
tattctaggg aaatgcttta aatggtcagg 2880 
agtttttaat acattggtta ggactcagag 2940 
atttacagat aaaatcccaa gagcctttta 3000 
tatactgctt aaaatacagt tagtgcctat 3060 
cttttggtct tctggctgaa gatgtaggca 3120 
ggtggagcct tcattggaaa ggggaaagag 3180 
tgcattcctc tgtcaggttc caatcaagag 3240 
gatgtcgtca gttaggaaac tgaagccata 3300 
atgggtgatt tcctgtattt taaaactgac 3360 
tttcaacacc atccttcaga ggacaatgtg 3420 
ttttcaggtg gtggggcctc taaaattgga 3480 
tcaaggggtc tcgaaaattt ggtacatctc 3540 
gtgcgggtgg cacacattta caggggca 3598 



<220> 

<221> misc_feature 

<223> Incyte ID No: 8543628CB1 



<400> 54 

gctattggat tctccctata 
gacgctgatt tgcgtgctgt 
gaggctccag catggccagt 
gcaagggcgc tgcgcaaacc 
gcggcgtgct ctacggcgtg 
gggtctacct ggaactgttc 
ttggcgtgcc gatcttcttc 
tgtcgttgtg gggcgccagc 
cccgtggcca acgcgaagca 
acgtactgct gccccaggcc 
gcatcatcaa gaccagctcc 



tgagtcgtat taatttctac 
tgttcttgcc gttgtcgctg 
tcgggtcttg agttgctctg 
ctgtcaattt cgtttttgag 
ctgcgcactt tgaataacaa 
cgcgccattc cggtgctggt 
ggcctgagca tcccgagttt 
gaggtgggcg aagtggtgcg 
ggcttgtcga ttggcctgtc 
ctcaggcgca tgacgccgcc 
ctggcggtgc tgatcggcgt 



tacgaaatgc tcgccgtgct 60 
ttgctcagcc gcctggaaag 120 
ggtgtcgttg ccgcaactgg 180 
catcggcttc agtacggttg 240 
ggcgatcaac ggcgtgttgc 300 
gtggttgtac ctgctgtttt 360 
ctggtgcgcg gtgctggtgc 420 
tggcgcgctg cattcgctgc 480 
cgacctacag ttgtacggct 540 
gaccatcaac gtgtacacgc 600 
ggtggatgtg atcaaggtcg 660 
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gccagcaaat catcgagcgc acctacgagt cggtattgat ctacggtgcg ctgttcctgt 720 
ttttcttctt tatctgctac ccgttgtcgg ccgcctccaa gctgctggaa cggcgctggg 780 
cccaagcatg agcgcattga tcgagttcca gggtttcaac aaattcttcg gcgaggcgca 840 
ggtgctcaag ggcatcgacc tgagcgtgca gcgcggcgaa gtggtggtga tcttaggccc 900 
cagcggctgt ggcaaaagca ccttgttgcg ctgcctcaat gggctggaag tcgcgcacag 960 
cggcagcctg cgctttgccg gcaaagagct gctggataaa accaccgact ggcgccaggt 1020 
acgccaggac attggcatgg tattccagag ctaccacctg tcccgcacat gagcgtgcct 1080 
cgaaacatcc tgctcggccg ctgaaagtgg aaaagcgcga ccgcgtggtt gcctgtgagc 1140 
aggcggaaaa atgctgggag cgccgtggcc ttgccgaaag cgcgacgcct ttcccacgcc 1200 
agctttccgg tggccagcaa caacgcatcg catcgtccgt cgttgtgcat gaaccctcag 1260 
tcatgctgtt gacgaagttc accggggcct cgccgaaatg gcaggaatgt tggagtgtcc 1320 
agaattgccg gaggggtacc ctgtgtttcc cagaatggtt gccggcggcg cgacagggtg 1380 
atgaggggcg tctcaaaaac ccggttttta ccgcacgcgc gactggagct tttcccccaa 1440 
gaggtaatta cgctgcctcg gggagaacaa cggagacaca cgcgc 1485 

<210> 55 
<211> 1470 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7482754CB1 

<400> 55 

gcctgttctg cagaggagga ggctgggctc agggcacttt tcacgccccc tccgttgggc 60 
tggggacctt ggtaagccat ggagggacag accccaggaa gcaggggcct tccagaaaag 120 
cctcaccctg ccacggctgc tgccactctg tcctcgatgg gcgctgtctt catcctcatg 180 
aagtccgcgc tgggagctgg cctgctcaac ttcccctggg ccttctccaa agcgggcgga 240 
gtggtccctg ccttcctggt ggagctggtc tcgttggtct tcctgatcag cgggctggtc 300 
atcctgggct atgctgctgc tgtcagtggc caggccacct accagggtgt ggtcaggggg 360 
ctgtgtggcc ctgccattgg gaagctgtgt gaggcctgct tcctcctcaa cctgctcatg 420 
atctccgtgg ccttcctcag ggtgatcggg gaccagctgg agaagctgtg tgactccctc 480 
ctgtctggca ccccgcccgc cccgcagccg tggtacgcag accagcgctt caccctgccc 540 
ctgctctccg tgctggtcat cctgcccctg tctgccccgc gggagatcgc cttccagaaa 600 
tacacaagcc cctcacacgg tcactgtgtt agcatcctag gcactctggc tgcctgttac 660 
ctggccctgg tcatcaccgt gcagtactac ctctggcccc agggcctcgt gcgtgagtcc 720 
catccttcac tgagccctgc ctcctggacc tctgtgttca gtgtcttccc caccatctgc 780 
ttcgggtttc agtgtcacga agctgccgtc tccatctact gcagcatgcg caaacggagc 840 
ctctcccact gggccctggt gtctgtgctg tccttgctgg cctgctgcct catctattca 900 
ctgacggggg tttatggctt cctgactttt gggacagaag tttctgctga cgtcttgatg 960 
tcctacccag gcaatgatat ggtcatcatt gtggcccggg tcctttttgc tgtctccatc 1020 
gtaactgtct accccatcgt gctcttcctg gggaggtcag tgatgcagga cttctggagg 1080 
aggagctgct tggggggatg ggggcccagc gccctggccg acccctcagg gctgtgggtc 1140 
cggatgccgc tgaccatcct gtgggtcacc gtgacgctcg ccatggcgct gtttatgcct 1200 
gacctcagcg agatcgtcag catcatcgga ggcatcagtt ccttcttcat cttcatcttc 1260 
ccaggccctg gaggcaaaca ccgtttctgc gaaggatggt ttcttaggaa accttcccaa 1320 
acagcccaag cattatctga gaaggggaag ccagcaaggt ttgtgcctca tctgtgcaat 1380 
gggtgtcgag cctataggac caagagtcaa gtgagaaacc tgcgagccac aggaagagag 1440 
tctctgtcta cccagtatga gggcatctga 1470 

<210> 56 
<211> 3132 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3794818CB1 

<400> 56 

atggagtttg tgcgggcgct gtggctgggc ctggcgctgg cgctggggcc ggggtccgcg 60 
gggggccacc ctcagccgtg cggcgtcctg gcgcgcctcg ggggctccgt gcgcctgggc 120 
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gccctcctgc cccgcgcgcc tctcgcccgc gcccgcgccc gcgccgccct ggcccgggcc 180 
gccctggcgc cgcggctgcc gcacaacctg agcttggagc tggtggtcgc cgcgcccccc 240 
gcccgcgacc ccgcctcgct gacccgcggc ctgtgccagg cgctggtgcc tccgggcgtg 300 
gcggccctgc tcgcctttcc cgaggctcgg cccgagctgc tgcagctgca cttcctggcg 360 
gcggccaccg agacccccgt gctcagcctg ctgcggcggg aggcgcgcgc gcccctcgga 420 
gccccgaacc cattccacct gcagctgcac tgggccagcc ccctggagac gctgctggat 480 
gtgctggtgg cggtgctgca ggcgcacgcc tgggaagacg tcggcctggc cctgtgccgc 540 
actcaggacc ccggcggcct ggtggccctc tggacaagcc gggctggccg gcccccacag 600 
ctggtcctgg acctaagccg gcgggacacg ggagatgcag gactgcgggc acgcctggcc 660 
ccgatggcgg cgccagtggg gggtgaagca ccggtacccg cggcggtcct cctcggctgt 720 
gacatcgccc gtgcccgtcg ggtgctggag gccgtacctc ccggccccca ctggctgttg 780 
gggacaccac tgccgcccaa ggccctgccc accgcggggc tgccaccagg gctgctggcg 840 
ctgggcgagg tggcacgacc cccgctggag gccgccatcc atgacattgt gcaactggtg 900 
gcccgggcgc tgggcagtgc ggcccaggtg cagccgaagc gagccctcct ccccgccccg 960 
gtcaactgcg gggacctgca gccggccggg cccgagtccc cggggcgctt cttggcacgg 1020 
ttcctggcca acacgtcctt ccagggccgc acgggccccg tgtgggtgac aggcagctcc 1080 
caggtacaca tgtctcggca ctttaaggtg tggagccttc gccgggaccc acggggcgcc 1140 
ccggcctggg ccacggtggg cagctggcgg gacggccagc tggacttgga accgggaggt 1200 
gcctctgcac ggcccccgcc cccacagggt gcccaggtct ggcccaagct gcgtgtggta 1260 
acgctgttgg aacacccatt tgtgtttgcc cgtgatccag acgaagacgg gcagtgccca 1320 
gcggggcagc tgtgcctgga ccctggcacc aacgactcgg ccaccctgga cgcactgttc 1380 
gccgcgctgg ccaacggctc agcgccccgt gccctgcgca agtgctgcta cggctactgc 1440 
attgacctgc tggagcggct ggcggaggac acgcccttcg acttcgagct gtacctcgtg 1500 
ggtgacggca agtacggcgc cctgcgggac ggccgctgga ccggcctggt cggggacctg 1560 
ctggccggcc gggcccacat ggcggtcacc agcttcagta tcaactccgc ccgctcacag 1620 
gtggtggact tcaccagccc cttcttctcc accagcctgg gcatcatggt gcgggcacgg 1680 
gacacggcct cacccatcgg tgcctttatg tggcccctgc actggtccac gtggctgggc 1740 
gtctttgcgg ccctgcacct caccgcgctc ttcctcaccg tgtacgagtg gcgtagcccc 1800 
tacggcctca cgccacgtgg ccgcaaccgc agcaccgtct tctcctactc ctcagccctc 1860 
aacctgtgct acgccatcct cttcagacgc accgtgtcca gcaagacgcc caagtgcccc 1920 
acgggccgcc tgctcatgaa cctctgggcc atcttctgcc tgctggtgct gtccagctac 1980 
acggccaacc tggctgccgt catggtcggg gacaagacct tcgaggagct gtcggggatc 2040 
cacgacccca agctgcacca cccggcgcag ggcttccgct tcggcaccgt gtgggagagc 2100 
agcgccgagg cgtacatcaa gaagagcttc cccgacatgc acgcacacat gcggcgccac 2160 
agcgcgccca ccacgccccg cggcgtcgcc atgctcacga gcgacccccc caagctcaac 2220 
gccttcatca tggacaagtc gctcctggac tacgaggtct ccatcgacgc cgactgcaaa 2280 
ctgctgaccg tgggaaagcc cttcgccatt gagggctatg ggatcggact gccccagaac 2340 
tcgccgctca cctccaacct gtccgagttc atcagccgct acaagtcctc cggcttcatc 2400 
gacctgctcc acgacaagtg gtacaagatg gtgccttgcg gcaagcgggt ctttgcggtt 2460 
acagagaccc tgcagatgag catctaccac ttcgcgggcc tcttcgtgtt gctgtgcctg 2520 
ggcctgggca gcgctctgct cagctcgctg ggcgagcacg ccttcttccg cctggcgctg 2580 
ccgcgcatcc gcaaggggag caggctgcag tactggctgc acaccagcca gaaaatccac 2640 
cgcgccctca acacggagcc accagagggg tcgaaggagg agacggcaga ggcggagccc 2700 
agcggccccg aggtggagca gcagcagcag cagcaggacc agccaacggc tccggagggc 2760 
tggaaacggg cgcgccgggc cgtggacaag gagcgccgcg tgcgcttcct gctggagccc 2820 
gccgtggttg tggcacccga agcggacgcg gaggcggagg ctgcgccgcg agagggcccc 2880 
gtctggctgt gctcctacgg ccgcccgccc gccgcaaggc ccacgggggc cccccagccc 2940 
ggggagctgc aggagctgga gcgccgcatc gaagtcgcgc gtgagcggct ccgccaggcc 3000 
ctggtgcggc gcggccagct cctggcacag ctcggggaca gcgcacgtca ccggcctcgg 3060 
cgcttgcttc aggccagagc ggcccccgcg gaggccccac cacactctgg ccgaccgggg 3120 
agccaggaat ga 3132 

<210> 57 
<211> 1832 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4717525CB1 

<400> 57 

ggagctgacc ctgcggggtc ccgggggggg agggggagcc gcgaagcccc cactgagtgc 60 
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cgccgctgcc ggggggggag ggggagccgc gaagccccca ctgaggccgc cgctgccggg 120 
cctcccctcc cccccgggcg ggcgccatgc gggggagccc gggcgacgcg gagcggcggc 180 
agcgctgggg tcgcctgttc gaggagctgg acagtaacaa ggatggccgc gtggacgtgc 240 
acgagttgcg ccaggggctg gccaggctgg gcgggggcaa cccagacccc ggcgcccaae 300 
agggtatctc ctctgagggt gatgctgacc cagatggcgg gctcgacctg gaggaatttt 3 60 
cccgctatct gcaggagcgg gaacagcgtc tgctgctcat gtttcacagt cttgaccgga 420 
accaggatgg tcacattgat gtctctgaga tccaacagag tttccgagct ctgggcattt 480 
ccatctcgct ggagcaggct gagaaaattt tgcacagcat ggaccgagac ggcacaatga 540 
ccattgactg gcaagaatgg cgcgaccact tcctgttgca ttcgctggaa aatgtggagg 600 
acgtgctgta tttctggaag cattccacgg tcctggacat tggcgagtgc ctgacagtgc 660 
cggacgagtt ctcaaagcaa gagaagctga cgggcatgtg gtggaagcag ctggtggccg 720 
gcgcagtggc aggtgccgtg tcacggacag gcacggcccc tctggaccgc ctcaaggtct 780 
tcatgcaggt ccatgcctca aagaccaacc ggctgaacat ccttgggggg cttcgaagca 840 
tggtccttga gggaggcatc cgctccctgt ggcgcggcaa tggtattaat gtactcaaga 900 
ttgcccccga gtcagctatc aagttcatgg cctatgaaca gatcaagagg gccatcctgg 960 
ggcagcagga gacactgcat gtgcaggagc gcttcgtggc tggctccctg gctggtgcca 1020 
cagcccaaac catcatttac cctatggaga ctctgaagaa ctggtggctt cagcagtaca 1080 
gccacgactc ggcagaccca ggcatcctcg tgctcctggc ctgcggtacc atatccagca 1140 
cctgcggcca gatagccagt tacccgctgg ccctggtccg gacccgcatg caggcacaag 1200 
cctccatcga gggtggcccc cagctgtcca tgctgggtct gctacgtcac atcctgtccc 1260 
aggagggcat gcggggcctc taccggggga tcgcccccaa cttcatgaag gttattccag 1320 
ctgtgagcat ctcctatgtg gtctacgaga acatgaagca ggccttaggg gtcacgtcca 1380 
ggctcgagta cagtggctcg atttcagatc actgcaacct ctgccttcca ggttcaagcg 1440 
attctcctgc ctcagcctcc agagtagctg gtattacagg atttcatcat gttgcccagg 1500 
ctcatctcgg actcgtgggt tcaaggaatt cggcagcctt cagccttcca acgtgctggg 1560 
attacaggaa gccggtcgtc atgccatgag cagccttatg gagtggacca tgtggtaagg 1620 
aactcagcca atagccatgt aactgagctt tgaagaggat cttgctgtcc tggccaacat 1680 
ctcactgcga ttctatcagt tgaatttcct ggatagtcca agctttgttg gatccctgca 1740 
ccggtacagc tgggttccag taactgaatc ctgagtctta tgatgtttta cttcaagcat 1800 
gggtcagggg acagcgggtt atgggttcct tg 1832 

<210> 58 
<211> 1902 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5091793CB1 

<400> 58 

cccaccttca gtcctggctc tgacaagccc tccagcttca cgccacccgg gatgggagaa 60 

agcaggtgtc gcgagagttg ggcgcaagac gccttgtagg gagtgtaact atggccggcc 120 

tgcggaacga aagtgaacag gagccgctct taggcgacac acctggaagc agagaatggg 180 

acattttaga gactgaagag cattataaga gccgatggag atctattagg attttatatc 240 

ttactatgtt tctcagcagt gtagggtttt ctgtagtgat gatgtccata tggccatatc 300 

tccaaaagat tgatccgaca gctgatacaa gttttttggg ctgggttatt gcttcatata 360 

gtcttggcca aatggtagct tcacctatat ttggtttatg gtctaattat agaccaagaa 420 

aagagcctct tattgtctcc atcttgattt ccgtggcagc caactgcctc tatgcatatc 480 

tccacatccc agcttctcat aataaatact acatgctggt tgctcgtgga ttgttgggaa 540 

ttggagcagg aaatgtagca gttgttagat catatactgc tggtgctact tcccttcagg 600 

aaagaacaag ttccatggca aacataagca tgtgtcaagc attaggtttt attctaggtc 660 

cagtttttca gacttgtttt acattccttg gagaaaaagg tgtgacatgg gatgtgatta 720 

aactgcagat aaacatgtat acaacaccag ttttacttag cgccttcctg ggaattttaa 780 

atattattct gatccttgcc atactaagag aacatcgtgt ggatgactca ggaagacagt 840 

gtaaaagtat taattttgaa gaagcaagta cagatgaagc tcaggttccc caaggaaata 900 

ttgaccaggt tgctgttgtg gccatcaatg ttctgttttt tgtgactcta tttatctttg 960 

ccctttttga aaccatcatt actccattaa caatggatat gtatgcctgg actcaagaac 1020 

aagctgtgtt atataatggc ataatacttg ctgctcttgg ggttgaagcc gttgttattt 1080 

tcttaggagt taagttgctt tccaaaaaga ttggcgagcg tgctattcta ctgggaggac 1140 
tcatcgttgt atgggttggc ttctttatct tgttaccttg gggaaatcaa tttcccaaaa 1200 
tacagtggga agatttgcac aataattcaa tccctaatac cacatttggg gaaattatta 1260 
ttggtctttg gaagtctcca atggaagatg acaatgaaag accaactggt tgctcgattg 1320 
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aacaagcctg gtgcctctac 
tgctaatagg attaggctat 
ttctaggacc aaaacctcag 
cccggattct tgggcctatg 
cattcagcct ggtgtgtgga 
aaagactcat tgctctttct 
gtgatggaaa ctacttgctg 
gttgggtgca gtgtctcatg 
tgcttgaggc caggaattca 
aaaataaaaa caaaaattta 

<210> 59 
<211> 2820 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 5945527CB1 

<400> 59 

gcgcccagct ctgtagcctc ctccgtcgac tcagccttag gtaccggtca ggcaaaatgc 60 
ggtcctccct ggctccggga gtctggttct tccgggcctt ctccagggac agctggttcc 120 
gaggcctcat cctgctgctg accttcctaa tttacgcctg ctatcacatg tccaggaagc 180 
ctatcagtat cgtcaagagc cgtctgcacc agaactgctc ggagcagatc aaacccatca 240 
atgatactca cagtctcaat gacaccatgt ggtgcagctg ggccccattt gacaaggaca 300 
actataagga gttactaggg ggcgtggaca acgccttcct catcgcctat gccatcggca 360 
tgttcatcag tggggttttt ggggagcggc ttccgctccg ttactacctc tcagctggaa 420 
tgctgctcag tggccttttc acctcgctct ttggcctggg atatttctgg aacatccacg 480 
agctctggta ctttgtggtc atccaggtct gtaatggact cgtccagacc acaggctggc 540 
cctctgtggt gacctgtgtt ggcaactggt tcgggaaggg gaagcggggg ttcatcatgg 600 
gcatctggaa ttcccacaca tctgtgggca acatcctggg ctccctgatc gccggcatct 660 
gggtgaacgg gcagtggggc ctgtcgttca tcgtgcctgg catcattact gccgtcatgg 720 
gcgtcatcac cttcctcttc ctcatcgaac acccagaaga tgtggactgc gcccctcctc 780 
agcaccacgg tgagccagct gagaaccagg acaaccctga ggaccctggg aacagtccct 840 
gctctatcag ggagagcggc cttgagactg tggccaaatg ctccaagggg ccatgcgaag 900 
agcctgctgc catcagcttc tttggggcgc tccggatccc aggcgtggtc gagttctctc 960 
tgtgtctgct gtttgccaag ctggtcagtt acaccttcct ctactggctg cccctctaca 1020 
tcgccaatgt ggctcacttt agtgccaagg aggctgggga cctgtctaca ctcttcgatg 1080 
ttggtggcat cataggcggc atcgtggcag ggctcgtctc tgactacacc aatggcaggg 1140 
ccaccacttg ctgtgtcatg ctcatcttgg ctgcccccat gatgttcctg tacaactaca 1200 
ttggccagga cgggattgcc agctccatag tgatgctgat catctgtggg ggcctggtca 1260 
atggcccata cgcgctcatc accactgctg tctctgctga tctggggact cacaagagcc 1320 
tgaagggcaa cgccaaagcc ctgtccacgg tcacggccat cattgacggc accggctcca 1380 
taggtgcggc tctggggcct ctgctggctg ggctcatctc ccccacgggc tggaacaatg 1440 
tcttctacat gctcatctct gccgacgtcc tagcctgctt gctcctttgc cggttagtat 1500 
acaaagagat cttggcctgg aaggtgtccc tgagcagagg cagcgggtat aaagaaatat 1560 
gaggccccaa ttggaacagc agcatggagg gtcccagttg ggtccccaac gtgctcccca 1620 
tgggcaagac aatggaaact tccacaagca ggggaggcaa accctcttta ttgaacatta 1680 
gccagcccag cccagacccc agggctgcct aaggacacag agattctcca tgggaagggg 1740 
actgccaagc atgaggaaat agaagattca ggggcctgag ctctggaagc tgcaagcaaa 1800 
agggatggga ctagggctga gttgtgtctc cattttgata aggaaaggat atgctcacac 1860 
tcttgcttgt tcagattcca agacagaagg cttcacaagg ccaacgcctg gaaaatgggc 1920 
atctctccct cccatgttaa gctttaacct ctgtaatctg cctgtatcta taggtgggca 1980 
tctcactcca tcaaaggagc ccagcctctc tttgtccctc taccatgcaa cagtcttctc 2040 
tgtgcatttc cccaagctgg gccctcttct actctccatt taggcctgtt gataactcca 2100 
ttacccaccc atcactgctg ttcctccagg gccagcactc gggcgaggca ggggagctgc 2160 
cttcggtaca taatttgaag gggcactccc tcttgggcac atgccggccc tgagtgcctc 2220 
ccttgcctca ctctgatccc ggccccataa tgtcctcagt ggaaggtgac tgggggccgg 2280 
tgctgtgggg agagtagaaa gaggggttgg catgactaaa aataccagta tgtgtattaa 2340 
gtattttgag aatgaaatgc caaggagtgc ctactgtatg ccagctctgc tctaggaatg 2400 
gagtagacag tggacacaag aaggacttac gccctgagca caggtgccaa cggtgacaag 2460 
actggcaaga cgtgagggca tgaatggttc attcaggcag ctgctgcagg tgtggtcacc 2520 



accccggtga ttcatctggc ccagttcctt acatcagctg 1380 
ccagtctgca atcttatgtc ctatactcta tattcaaaaa 1440 
ggtgtataca tgggctggtt aacagcatct ggaagtggag 1500 
ttcatcagcc aagtgtatgc tcactgggga ccacgatggg 1560 
ataatagtgc tcaccatcac cctcctggga gtggtttaca 1620 
gtaagatatg ggaggattca ggaataaact agctaagact 1680 
tgtggcactt cctggtctaa agctctgcta gacaattgcg 1740 
cctgtaatcc cagcactttg gaaggccaag gcaggaggat 1800 
agaccagcct gggaaacaaa gtgagaccct gtctttacaa 1860 
aaactgtaaa aaaaaaaaaa aa 1902 
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tggtgccatc tgctgctccc ttttccactt ttctatgtcc tccttccacc ccaagtcccg 2580 

gatcactcgc tgttttctgg ctagctcttg gcatctccat ctgagcctaa agttgcccac 2640 

tggcaccaat agattctgtt tgacctgctg tgcccatgct catctttgtc ggggcttggc 2700 

ttggcagcga gtgaggctgg catggggtgc tgaagtgggg tctcagtgag gggtgacagc 2760 

gtgctggcag tcctcacagc cctcgctcgc tctcggcgcc tcctctgcct gggctcccac 2820 



<210> 60 
<211> 3920 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6941124CB1 



<400> 60 

ccggccgtga gtgcgcgagt gtgccatggc cgcggccacc aggcgcgtgt tccatctgca 60 
gccgtgcggt gggtgcctga acccgggggc gcggcgggaa ggccgcccgg gtctccgcca 120 
ccgtctcccg tcggggaccc cggggagaac gcggatgccg tctgcctgcg cgggccgggg 180 
ccggtgggcg agagggctgc gggattccca gccgcgggga ggagggcgcg ggtgcgggag 240 
gcccgggcgc ggcttcgcgg ccaccgcgcg cttggtttca ttagggctct gctgtcgccg 300 
tcttgaattt cttaccaggt ttgaacagct gatgcagcgg gtcgagggcg gggagctcga 360 
aagtgggcag ggcggagctg tggggctccc agcgccccga gcgccgagct tttgggagcc 420 
tggcgggccg ccgcctcgga aggtctcatt ttaccttccg cccgggcatc ttcccgtccg 480 
gcgccgcggt gggagccccg gaacccctgg agttttggga gtcccgggac ccctgaccgt 540 
gccctgcgcg gccgcagcgg tttctccctt gctctgacac gggagggcca gccctgggtc 600 
cccgccagga cccaggtttg tacgaccctt tacctgcacg tgggccgccg ggacggaagc 660 
ggagtgtgca ccggcccaac cctcccagac cctcgtcgcc cacggcctgg gacgcgcccc 720 
agcgccccct agcagatggg ctaagcaatt gaatttctga gattaaggct ggcttcccct 780 
gctatggggc cccctcccga gcactgtccc ctgcttccct ttctccctag ggtggcgtgg 840 
gttgcatccc tcggaagagc gaaggaatga gccaggtcgg ggggcgggga gacaggtgca 900 
cacaggaggt ccagggcttg gtccatgggg ctggtgacct ttctgcttcc cttgcagaaa 960 
actctcccac catgtcgcag aatggatact tcgaggattc aagctactac aagtgtgaca 1020 
cagatgacac cttcgaagcc cgagaggaga tcctggggga tgaggccttc gacactgcca 1080 
actcctccat cgtgtctggc gagagtatcc gtttcttgtc aatgtcaacc ttgagatgca 1140 
ggcccacaac actgagaatg aagcgacttc cggtggctgt gtgctcctgc acacctcccg 1200 
aaagtacctg aagttaaaga acttcaagga agagatccgt gcgcaccgcg acctagatgg 1260 
cttcctggcg caggccagca tcgtcctgaa cgagacggcc acctccctgg ataacgtgct 1320 
gcggaccatg cttcgccgct tcgcccggga ccctgacaac aatgagccca actgcaacct 1380 
ggacctgctc atggccatgc tcttcaccga tgccggggca cccatgcggg gtaaagtcca 1440 
cctgctgtca gataccatcc aaggggtcac cgccacagtg acaggggtgc ggtaccagca 1500 
gtcgtggctc tgcatcatct gtaccatgaa ggccctacag aagcggcacg tgtgcatcag 1560 
ccgcctggtt cgcccacaga actgggggga gaattcctgt gaggttcggt tcgtcatcct 1620 
ggtgctggcc ccacccaaga tgaaaagcac taagactgcg atggaggtgg cgcgcacgtt 1680 
tgccaccatg ttctcggata tcgccttccg ccagaagctc ctggagaccc gcacagagga 1740 
ggaattcaag gaggccttgg tgcatcagag acagctgctc accatggtga gccacggtcc 1800 
agtggcgccg agaacgaagg aacgcagcac agtctccctc cctgcccaca gacacccaga 1860 
gcccccaaag tgcaaggact ttgtcccttt tgggaagggc atccgggagg acatcgcacg 1920 
caggttcccc ttgtacccct tggacttcac tgatggcatt attgggaaaa acaaggctgt 1980 
gggcaaatac atcaccacca ccctgttcct ctacttcgcc tgcctcctgc ccaccatcgc 2040 
tttcgggtct ctcaatgacg agaacacaga cggggccatc gacgtgcaga agaccatagc 2100 
cgggcagagc atcgggggcc tgctctacgc gctcttctct gggcagccat tggtgattct 2160 
gctgaccacc gcgcccctgg cgctctacat ccaggtgatt cgtgtcatct gtgatgacta 2220 
tgacctggac ttcaactcct tctacgcatg gacgggcctg tggaatagtt tcttccttgc 2280 
gctttatgcc tttttcaacc tcagcctggt catgagtctc ttcaagaggt cgacggagga 2340 
gatcatcgcc ctcttcattt ccatcacgtt tgtgctggat gccgtcaagg gcacggttaa 2400 
aatcttctgg aagtactact atgggcatta cttggacgac tatcacacaa aaaggacttc 2460 
atcccttgtc agcctgtcag gcctcggcgc cagcctcaac gccagcctcc acactgccct 2520 
caatgccagc ttcctcgcca gccccacgga gctgccctcg gccacacact caggccaggc 2580 
gaccgccgtg ctcagcctcc tcatcatgct gggcacgctc tggctgggct acaccctcta 2640 
ccaattcaag aagagcccct acctgcaccc ctgcgtgcga gagatcctgt ccgactgcgc 2700 
cctgcccatc gcggtgctcg ccttctccct catcagctcc catggcttcc gggaaatcga 2760 
gatgagcaag ttccgctaca accccagcga gagccccttt gcgatggcgc agatccagtc 2820 
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gctgtccctg agggccgtca gcggtgccat gggcctcggc ttcctgctgt ccatgctctt 2880 
cttcatcgag cagaacttgg tggccgcctt ggtgaatgca ccggagaaca ggctggtgaa 2940 
gggcactgcc taccactggg acctcctgct cctcgccatc atcaacacag ggctgtctct 3000 
gtttgggctg ccttggatcc atgccgccta cccccactcc ccgctgcacg tgcgagccct 3060 
ggccttagtg gaggagcgtg tggagaacgg acacatctat gacacgattg tgaacgtgaa 3120 
ggagacgcgg ctgacctcgc tgggcgccag cgtcctggtg ggcctgtccc tgttgctgct 3180 
gccggtcccg cttcagtgga tccccaagcc cgtgctctat ggcctcttcc tctacatcgc 3240 
gctcacctcc ctcgatggca accagctcgt ccagcgcgtg gccctgctgc tcaaggagca 3300 
gactgcgtac cccccgacac actacatccg gagggtgccc cagaggaaga tccactactt 3360 
cacgggcctg caggtgcttc agctgctgct gctgtgtgcc ttcggcatga gctccctgcc 3420 
ctacatgaag atgatctttc ccctcatcat gatcgccatg atccccatcc gctatatcct 3480 
gctgccccga atcattgaag ccaagtactt ggatgtcatg gacgctgagc acaggccttg 3540 
actggcagac cctgcccacg ccccattcgc cagccctcca cgtcctccca ggctggctct 3600 
ggagctgtga ggggaggtgt aggtgtgtgg gtgactgctc tgtgctgcgc cttctcatgg 3660 
ctgactcagg cctggggcat ctgggcattg taggggtgca gtggtatgtg cccacccctc 3720 
tcccattatc ctttagcttt aggccaagag cgttgctcag ggcagcttct gcccagggtg 3780 
ggtgggactg agcaggatgg attttctttt gataaaagag tcgatgcctg aaagagaaac 3840 
catttccttg attgtgtaag gaacttgctg gacgcacatt agagaataaa gctcctgttt 3900 
ctaggctcct aaaaaaaaaa 3920 

<210> 61 
<211> 1333 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> incyte ID No: 6972530CB1 

<400> 61 

tggggcaacc tgctaacagc cgctgttgtg gctttgctgt atggcggact ggtcgtgggc 60 
ttccttccgc ggcagggcgt ttccttcgaa gggcatctgt ttggtgcgct cgccggattt 120 
attgcagcca aggtacttct atccacgccg cgcagcgggt ttaatgccgg gtgaacgcac 180 
ttgagcatcc agacgccgag gcgatgctgt cgtcaggatt caagtaccca gggagccagg 240 
ggctaaatgt cgattttcct actgctacgc atgtacgcct cgtccctctt tcatcgcttc 300 
ggttgggcag gcctgatcgt tgccttgggt gtccacctga gcaccgctta cctgggcctg 360 
gtgttgctgg gcgaacaaca cctcaccgct gccgccacgt ttatctactt ctatctcacc 420 
accacactca ccgtcggcta tggcgacctc gcgccgcaga cgtctgctgg gcgaattttc 480 
gtggcggcct gggtcatgct cgggggcatt gcgctgctca cggcagccat cggcaaaacc 540 
accagcagtg ttatcgatgc atggagaaaa ggcatgaagg gcaaaggtga tttcaccggc 600 
aaggtcggcc ataccgtgtt gatcggctgg gagggcgcgt ccagcgagcg ggtgattgag 660 
ttgttgctgc aagacgaaac ctcgaacgac aacctgatcg tgatctgcga ttgcagcctt 720 
gaagaaaatc cgatgccggg caaagcagcg ttcatccgcg gcgaaagcct gtcttccacc 780 
gcactgttgc tgcgtgccgg tgtgcccggt gccgagcgtg tgctggtgcg aaccccgtct 840 
gatgacctga cgctggccac cgtgctggcg gtgaaccaac tgagtcccgt cgggcatgtg 900 
gtcgcccact tcaacgaaag tgaaatcgcc gcactcgcca gctcctacgc cccgcgcctg 960 
gagtgcacgt ccagcatggc aattgaaatg ctggtgcgcg cctctcagga tccgggctcg 1020 
tcggtggtca tcaatgaatt gctgtgcgtg gggcagggcg ccacccagta cctgatgaaa 1080 
ttgcccgaag cctttgaggc aacattcggc gagctgtaca cgcagatgaa ggagcgccac 1140 
aacgccaccc tcattggcta tcgcgccaaa ggtgttcagc agccgtcgat caacccgccc 1200 
agtgccacag aggtcaaagg cggcgaactc ttctacatcg cttccactcg ccttaaggaa 1260 
atctcccatg ggatggctta aacagttgat ggggcttgag gccccaaaag ccactaccga 1320 
ctcgagccga acc 1333 

<210> 62 
<211> 8487 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6991750CB1 
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<400> 62 

atgcacctat taataattta cattcaattt cagattgcag aggagggatc aaccatttct 60 
tgtgtggttg agagaaccag aggggctctg gattatgtgc atgtttttta caccatttca 120 
cagattgaaa ctgatggcat taattacctt gttgatgact ttgctaatgc cagtggaact 180 
attacattcc ttccttggca gagatcagag gttctgaata tatatgttct tgatgatgat 240 
attcctgaac ttaatgagta tttccgtgtg acattggttt ctgcaattcc tggagatggg 300 
aagctaggct caactcctac cagtggtgca agcatagatc ctgaaaagga aacgactgat 360 
atcaccatca aagctagtga tcatccatat ggcttgctgc agttctccac agggctgcct 420 
cctcagccta aggacgcaat gaccctgcct gcaagcagcg ttccacatat cactgtggag 480 
gaggaagatg gagaaatcag gttattggtc atccgtgcac agggacttct gggaagggtg 540 
actgcggaat ttagaacagt gtccttgaca gcattcagtc ctgaggatta ccagaatgtt 600 
gctggcacat tagaatttca accaggagaa agatataaat acattttcat aaacatcact 660 
gataattcta ttcctgaact ggaaaaatct tttaaagttg agttgttaaa cttggaagga 720 
ggagccagtc taggagtggc ttcccaaatt ctagtgacaa ttgcagcctc tgaccacgct 780 
catggcgtat ttgaatttag ccctgagtca ctctttgtca gtggaactga accagaagat 840 
gggtatagca ctgttacatt aaatgttata agacatcatg gaactctgtc tccagtgact 900 
ttgcattgga acatagactc tgatcctgat ggtgatctcg ccttcacctc tggcaacatc 960 
acatttgaga ttgggcagac gagcgccaat atcactgtgg agatattgcc tgacgaagac 1020 
ccagaactgg ataaggcatt ctctgtgtca gtcctcagtg tttccagtgg ttctttggga 1080 
gctcatatta atgccacgtt aacagttttg gctagtgatg atccatatgg gatattcatt 1140 
ttttctgaga aaaacagacc tgttaaagtt gaggaagcaa cccagaacat cacactatca 1200 
ataataaggt tgaaaggcct catgggaaaa gtccttgtct catatgcaac actagatgat 1260 
atggaaaaac caccttattt tccacctaat ttagcgagag caactcaagg aagagactat 1320 
ataccagctt ctggatttgc tctttttgga gctaatcaga gtgaggcaac aatagctatt 1380 
tcaattttgg atgatgatga gccagaaagg tccgaatctg tctttatcga actactcaac 1440 
tctactttag tagcgaaagt acagagtcgt tcaattccaa attctccacg tcttgggcct 1500 
aaggtagaaa ctattgcgca actaattatc attgccaatg atgatgcatt tggaactctt 1560 
cagctctcag caccaattgt ccgagtggca gaaaatcatg ttggacccat tatcaatgtg 1620 
actagaacag gaggagcatt tgcagatgtc tctgtgaagt ttaaagctgt gccaataact 1680 
gcaatagctg gtgaagatta tagtatagct tcatcagatg tggtcttgct agaaggggaa 1740 
accagtaaag ccgtgccaat atatgtcatt aatgatatct atcctgaact ggaagaatct 1800 
tttcttgtgc aactgatgaa tgaaacaaca ggaggagcca gactaggggc tttaacagag 1860 
gcagtcatta ttattgaggc ctctgatgac ccctatggat tatttggttt tcagattact 1920 
aaacttattg tagaggaacc tgagtttaac tcagtgaagg taaacctgcc aataattcga 1980 
aattctggga cactcggcaa tgttactgtt cagtgggttg ccaccattaa tggacagctt 2040 
gctactggcg acctgcgagt tgtctcaggt aatgtgacct ttgcccctgg ggaaaccatt 2100 
caaaccttgt tgttagaggt cctggctgac gacgttccgg agattgaaga ggttatccaa 2160 
gtgcaactaa ctgatgcctc tggtggaggt actattgggt tagatcgaat tgcaaatatt 2220 
attattcctg ccaatgatga tccttatggt acagtagcct ttgctcagat ggtttatcgt 2280 
gttcaagagc ctctggaaag aagttcctgt gctaatataa ctgtcaggcg aagcggaggg 2340 
cactttggtc ggctgttgtt gttctacagt acttccgaca ttgatgtagt ggctctggca 2400 
atggaggaag gtcaagattt actgtcctac tatgaatctc caattcaagg ggtgcctgac 2460 
ccactttgga gaacttggat gaatgtctct gccgtggggg agcccctgta tacctgtgcc 2520 
actttgtgcc ttaaggaaca agcttgctca gcgttttcat ttttcagtgc ttctgagggt 2580 
ccccagtgtt tctggatgac atcatggatc agcccagctg tcaacaattc agacttctgg 2640 
acctacagga aaaacatgac cagggtagca tctcttttta gtggtcaggc tgtggctggg 2700 
agtgactatg agcctgtgac aaggcaatgg gccataatgc aggaaggtga tgaattcgca 2760 
aatctcacag tgtctattct tcctgatgat ttcccagaga tggatgagag ttttctaatt 2820 
tctctccttg aagttcacct catgaacatt tcagccagtt tgaaaaatca gccaaccata 2880 
ggacagccaa atatttctac agttgtcata gcactaaatg gtgatgcctt tggagtgttt 2940 
gtgatctaca atattagtcc caatacttcc gaagatggct tatttgttga agttcaggag 3000 
cagccccaaa ccttggtgga gctgatgata cacaggacag ggggcagctt aggtcaagtg 3060 
gcagtcgaat ggcgtgttgt tggtggaaca gctactgaag gtttagattt tataggtgct 3120 
ggagagattc tgacctttgc tgaaggtgaa accaaaaaga cagtcatttt aaccatcttg 3180 
gatgactctg aaccagagga tgacgaaagt atcatagtta gtttggtgta cactgaaggt 3240 
ggaagtagaa ttttgccaag ctccgacact gttagagtga acattttggc caatgacaat 3300 
gtggcaggaa ttgttagctt tcagacagct tccagatctg tcataggtca tgagggagaa 3360 
attttacaat tccatgtgat aagaactttc cctggtcgag gaaatgttac tgttaactgg 3420 
aaaattattg ggcaaaatca agaactcaat tttgctaact ttagcggaca acttttcttt 3480 
cctgaggggt cgttgaatac aacattgttt gtgcatttgt tggatgacaa cattcctgag 3540 
gagaaagaag tataccaagt cattctgtat gatgtcagga cacaaggagt tccaccagcc 3600 
ggaatcgccc tgcttgatgc tcaaggatat gcagctgtcc tcacagtaga agccagtgat 3660 
gaaccacatg gagttttaaa ttttgctctt tcatcaagat ttgtgttact acaagaggct 3720 
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aacataacaa ttcagctttt catcaacaga 
acatatacca cggttcctgg aatgctgagt 
gagccagaag ttgattttgt ccctatcatt 
gcagcagcca tcaacattac cattcttgag 
ctggtgaatt taacttacgt tggacttacc 
ctagattcag aaggtttgac tgcacaagtt 
gtaattgaat ggcaacaaag caggtttgaa 
gtagcccaga ggagcagaga acctcttggc 
ttggaagcac aagtggggct ggattatatc 
ggagaaaggt ataaaaatgt caatatcatg 
gaaaaatttc agctgatttt aacaaatcct 
atagccttaa ttattgtcct tgctaatgat 
agtgagcact ttttcctaag agagccaaca 
ttgtacattg ttcgggaacc tgcacaagga 
gtgacagaag tgaattcctc aaatgaatct 
gttttagaag aaggtgttcg attcaaggcc 
ccagaaatgg atgagtattt tgtttgcacc 
ggggtgcatg ttcaaaccct gataacagtt 
agtatctctg cagttgaaaa tagagccacc 
gtgtatttaa atgtatctcg aactaatggc 
ccgaaaactt gtgaggcctt taatattggt 
gaaagaaatg aagaaaagcc ttctcttaac 
ttattcctgg tacaaacaat cattattctg 
gacagccaag attatttaat cattgcaagt 
ttcaggtgga atggaggaag cttcgtgttg 
accgtggcct tgttcaacaa gggaggctct 
aggctaaact cccttttatt cagatggtct 
cctgtcagtg ggacaacaga agttgaggct 
tttgccaaaa atgtctttct aggagatcag 
ggacagtctt ccttcaggta ttttcagtct 
tccttcacac cagcctcagg aatagcccac 
ctttactgct ggaattcgga gcgtaatcaa 
tatgatgtgg cttctgttac agtaaagtcc 
gtgggagctc attcacatat atatgagcta 
cctagttcag gtgaactgat atttgaacct 
atccttgatg atacagttcc agaaaaagaa 
aaaggaggag cagagattgg cattaatgat 
gatgcctatg gaattgttgc atttgctcag 
gagcaagata gcctagtaac cttgaacgtt 
accatagcat gggaagctga tggaagtatt 
gaaaaacgaa tgtcagcaaa gattttattt 
ctaactattc ttgctgataa tataccagag 
cgtatcacca cagaaggggt tgaggactca 
agcaagtctg ttataacaac tttgcccaat 
gctgcgtctg tcttcattag agtagcagag 
caaatagctc gagataaagg actacttggg 
aatttcttac tgcatgtcga taatcaagct 
acaataataa taatgaaaga aaacataaaa 
gatgaccttc ctgaattgga ggaaggattt 
aactctgact tctctacagg acagccaagt 
ataatgatag aagaaaatga cgatcccaga 
gctggggaag ttattactgc ctatgaggtg 
gtagtccggc tggctggaag ctttggggca 
agtgctggcc tggaagactt taaaccatct 
gttactgcaa tgatagaaat caccataatt 
ttcaatattt ccttgatcag tgttgctgga 
actgttgtta ttccacaaaa tgattctcca 
gtaatgattg atgaatccct ttcatccgat 
gtccggtccc caggaggaaa aggaaccgtc 
aaacataacc ttagtccttt gaatgggacc 
accattgtgt tgcacacact tcaagacaca 
cagctgatat caattgatga ggtagaaata 
attcggggtg ataagcgagc atcaggagaa 



gaatttggat ctctaggagc tatcaatgtc 3780 
ctgaagaacc aaacagtagg aaacctagca 3840 
ggctttctga ttttagaaga aggggaaaca 3900 
gatgatgtac cagagctaga agaatatttc 3960 
atggctgctt caacttcatt tcctcccaga 4020 
attattgatg ccaatgatgg ggcccgaggt 4080 
gtaaatgaaa cccatggaag tttaacattg 4140 
catgtttcct tatttgtgta tgctcagaat 4200 
ttcaccccaa tgattcttca ttttgctgat 4260 
attcttgatg atgacattcc agaaggagat 4320 
tctcctggac tagagctagg gaaaaataca 4380 
gacggccctg gagttctatc atttaacaac 4440 
gctctctacg tccaggagag tgttgcagta 4500 
ttgtttggaa cagtgacagt tcagttcatt 4560 
aaagatctga ctccttccaa aggctatatt 4620 
ctacaaatat ctgccatatt agacacggaa 4680 
ttgtttaatc caactggagg tgctagacta 4740 
ttgcaaaacc aggccccttt ggggctattc 4800 
tccatagaca tcgaagaagc caataggacc 4860 
attgatttgg ctgatttaaa tatagaaaat 4920 
ttttctccct actttgtgat tactcatgaa 4980 
agtgtgttta cattcacatc tggatttaaa 5040 
gaaagttctc aagtaagata ttttacttca 5100 
caaagagatg attccgaatt aactcaggtc 5160 
catcaaaaac tccctgtccg aggtgtgctg 5220 
gtgttcttag ccatttccca ggctaatgcc 5280 
ggcagtgggt ttattaactt tcaagaggtg 5340 
ttgtcttcag ccaatgatat ttacctaata 5400 
aattcaattg atattttcat ctgggagatg 5460 
gtagattttg ctgctgttaa cagaatccac 5520 
atacttctta ttggccaaga tatgtctgct 5580 
ttctcttttg ttctggaagt accttctgct 5640 
cttaattcaa gcaagaattt aatagctcta 5700 
gcctacattt ccagccattc tgactttatt 5760 
ggtgagagag aagctacaat agcagtaaat 5820 
gaatccttca aagttcaact taaaaatccc 5880 
tctgtaacaa taaccattct gtctaatgat 5940 
aattcattat ataagcaagt ggaagaaatg 6000 
gaacgcttaa aaggaacata tggccgtata 6060 
agtgatatat ttcctacctc aggagtggtg 6120 
actgaaggcc aggtactgtc aacaatcact 6180 
ttatcagagg ttgtgattgt aaccctcacc 6240 
tacaaaggtg ctactattga tcaggacaga 6300 
gactcacctt ttggcttggt gggctggcgt 6360 
cctaaagaaa acaccaccac tcttcagtta 6420 
gatattgcca ttcacttgag agctcaaccc 6480 
actgagaatg aagattatgt attgcaagaa 6540 
gaagctcatg ccgaagtttc cattttgccg 6600 
attgtcacta tcactgaggt gaacctggtg 6660 
gtgcggaggc ccggaatgga aatagctgag 6720 
ggaattttta tgtttcatgt tactagaggc 6780 
cctccaccct tgaacgttct tcaagttcct 6840 
gtaaatgttt attggaaagc atcaccagac 6900 
catgggattc ttgaatttgc agataaacag 6960 
gatgatgctg aatttgaatt gacagagacg 7020 
ggtggcagac ttggtgatga tgttgtggta 7080 
tttggagtat ttggatttga agaaaagact 7140 
gaccctgatt catatgtgac attgacggtt 7200 
cgacttgagt ggaccataga tgagaaggct 7260 
cttcattttg atgagactga gtcccagaag 7320 
gtgttggagg aggacaggcg tttcaccatt 7380 
tctccagtaa aaggtagtgc atcaataatt 7440 
gttgggatag ctccgtcatc taggcacatc 7500 
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ctcattgggg aaccctcagc 
ccagggattt tgggggaggt 
tttgctgaaa catcaggaaa 
atacaggctt tgaacgatga 
gcagtcagtg agggaggagt 
gccagcgact ctccctatgg 
gcacagaggg ttaacatcac 
tggtacaaga cgatgagcgg 
gagctcctct ttgaagcagg 
gactatcctg aaggcccaga 
aggtaaagga gaaaggcaat 
ttctattgtg tagagattaa 
atttgtgtga cattagaaca 
ttttaatttt gttaaatata 
ggacacactc tgggtagcca 
cagcctaaca gctctctaaa 
ttccatactt gtgaaagggt 

<210> 63 
<211> 3264 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 71726948CB1 

<400> 63 

cgctggaagc agctggaggc tccagttctc atctgctcag gtgtccccgg cgccttggcg 60 
aactcggcca ctccagttcc tcacgtggtg agcactcagg gcagcgggtc gattttccga 120 
ggtcccatac ctgggtttga ggggcgcggc tcgcagcggc gggtgcaggg gcgactgcca 180 
gccctcaccc cgcctcgggg tgcgttcgga ggccgacacc tggaggacgc ctccagtccc 240 
cgcgggacgc cacgcctgcg cgccagggat ccgggataag aagtgcgcgc cgggctccgg 300 
ctgcgcgccg cggggccacc agtttgcgcg cagggctcag gcgaccgtgc ggccatggac 360 
acgccacggg gcatcggcac cttcgtggtg tgggactacg tggtgttcgc gggcatgctg 420 
gtcatctcgg ccgccatcgg catctactac gccttcgctg ggggcggcca gcagacctcc 480 
aaggacttcc tgatgggcgg ccgcagaatg accgcagtgc ccgtggcgct gtccctcacc 540 
gctagcttca tgtcagccgt cactgtcctg ggcaccccct ccgaggtcta ccgttttggg 600 
gccattttta gcatctttgc cttcacctac ttctttgtgg tggtcatcag cgcggaggtc 660 
ttcctcccgg tgttctacaa actgggaatt accagcacct acgagtattt agaacttcga 720 
tttaacaaat gtgttcgtct ctgtggaaca gtcctcttca ttgttcaaac aattctgtat 780 
actggaattg ttatttatgc ccctgccctg gctttgaatc aagtcacagg atttgatctg 840 
tggggcgcgg tagtggcaac gggggtggtc tgcacattct actgcacact gggtggtctt 900 
aaagcagtta tctggacaga tgtttttcaa gttgggatca tggtggctgg atttgcatcc 960 
gtgattatac aggctgtggt gatgcaaggt ggaatcagca ctattttaaa tgatgcctat 1020 
gatggtggaa gattaaattt ctggaatttt aatcctaacc ctttgcaaag acacaccttc 1080 
tggacaatta ttataggagg gaccttcaca tggaccagca tctacggtgt caaccaatcc 1140 
caggtgcaga gatatatttc ttgtaaaagc agattccagg caaaactgtc tctctacatc 1200 
aatcttgtgg gactctgggc aatcctcaca tgctcagtgt tttgtgggct cgccctatat 1260 
tccaggtacc atgactgtga tccttggaca gccaagaaag tgtctgcacc agaccagctc 1320 
atgccttatt tggtactgga cattctgcaa gattatccag gacttcctgg actttttgtg 1380 
gcctgtgctt acagtgggac attaagcaca gtgtcctcca gtattaatgc cttagcagca 1440 
gtaactgtgg aagatctaat caaaccttac ttcagatcgc tctcagaaag gtctctgtct 1500 
tggatttccc aaggaatgag tgtggtgtat ggagccctgt gtattggaat ggctgcgctg 1560 
gcgtcactta tgggagcttt gttgcaggca gcactcagcg tatttggtat ggttggtgga 1620 
ccacttatgg gcctgttcgc tttgggcatt ttggttccct ttgccaactc aattggagca 1680 
cttgttggtc tgatggctgg atttgccatt tctctatggg ttggaattgg agctcaaata 1740 
tatcctccac ttcctgagag aacattgcca ttgcaccttg atatccaagg ctgtaacagc 1800 
acctacaatg agacaaattt gatgacaacc acagaaatgc catttactac tagtgttttt 1860 
caaatataca atgttcaaag gactccactg atggataact ggtattcttt atcatatctg 1920 
tacttcagca ctgttggaac tttggtaaca ttattagtgg ggatacttgt cagtttatca 1980 
acaggaggaa gaaaacagaa cttagacccc agatatatac taaccaaaga ggacttttta 2040 
tccaattttg atatttttaa gaaaaagaag catgttttga gctataaatc acatccagtg 2100 



aaaatataat ggtaccgcta 
cacagtgttc tggaggatat 
actgacaatg cgagacgaac 
cattcccgag gaaaaaagct 
tctgagtgaa tccagcagca 
ccgatttgcc ttttcacatg 
aatcatccgt tccagtggag 
gacagcggaa gcaggcttgg 
ggagatgagg aaaagtctgc 
ggaattttct ctaacaatta 
taggaaaaag aaagcaaaga 
tactgagctc taaagcagtt 
ggacatacaa gggaaattat 
acttccctct gaaaattgaa 
ggaagttgaa tatgataacg 
tcagaggtca gctggtgtgg 
gaaggga 



ttatcagcct tgttcgaggc 7560 
tccctccttc cgtgggggaa 7620 
agtctgcagt cattgtagta 7680 
tctatgagtt tcagctcact 7740 
ctgccaacat cacggtggtg 7800 
agcaacttcg agtgtcagaa 7860 
attttggcca tgtgcgactc 7920 
attttgttcc tgcagcaggg 7980 
atgttgaaat ccttgatgat 8040 
caaaggtgga actccaggga 8100 
gcagagagaa agagagaaag 8160 
ctttccctac tctttctcag 8220 
gacaatgact gagttttttt 8280 
agtacgtgtt tctattgtgt 8340 
gcttctaatt ggtgaagaca 8400 
ctgaatttgc ttcttttgat 8460 

8487 
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gaagatggtg gaactgataa tcctgctttc aaccacattg aattgaactc agatcagagt 2160 
ggcaagagca atgggactcg tttgtgaagc tgctctgata ctagatatcc ttaaatgatg 2220 
tttcaatttt atatgttttc taagataatt ggatcaggtt ttctttgtgt gtgtgtgtgt 2280 
gttgtatcat gagtgtttgg gggataagtt tttgttaaaa caaagtctgg actatcttca 2340 
tttactacat cattaattga tgttactctg gagtttagaa ttctggcatt gacatttccc 2400 
tctctttcct ttatttcgat gaagctataa ttgtgaaaat tgtaactaca tagatgctga 2460 
aaggctaata cacacatatg cacatgtatt tgattgtcaa aggtatattc ttaaatttgg 2520 
gtattattga aaatattttc catgccttgg tgctagcata taagtttgga agtttgccaa 2580 
catcacaatt catcttgaaa agagcttttt tccctcctac cacatacacc attcttaggg 2640 
agcaatgagg taacaggtct gtgttgtcta gatctttgct ttttatcccc ctatcagtcc 2700 
agggcatata ctaacctgca aactgattct gaatcaggaa ggtggtaatc aataagtatt 2760 
ctggctggga aagaccgtgg gcccaatgat caaagtcttc ttggtgctgt tcattaattc 2820 
ttgtgccttt tggcttgttt tctagagttt ctgggctttg gctgctgata ctgcctttct 2880 
tagactgtaa tttttatctg catgcccagt ttctgaccta tcaacttggg ttttattgtg 2940 
cactctaact gagcttgtct tcataatttt ctgtttattg ccctgggctt ggatatgtct 3000 
caagacactc atgtgaatca tgccacccca aatcctggct tatcaagtcc cagactataa 3060 
attatgaact cccattagct tggtactaac atatacttga tgtaggtatt tatggacttg 3120 
atgatccaag aatattatat tcttcaaaat ggttaagctc catggagtta gatgactaca 3180 
cttaatgcta ttaagttgaa cttttgaatg tcaactaatt tgcaatcaat taaagataca 3240 
tatgcctaga aaaaaaaaaa aaaa 3264 

<210> 64 

<211> 1659 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7487393CB1 

<400> 64 

atgggctttg atgtgctcct ggatcaagtg ggtggcatgg ggagattcca gatttgtctg 60 
atagctttct tttgcatcac caacatccta ctgttcccta atattgtgtt ggagaacttc 120 
actgcattca cccctagtca tcgctgctgg gtccccctcc tggacaatga cactgtgtct 180 
gacaatgata ccgggaccct cagcaaggat gacctcctga gaatctccat cccactggac 240 
tcaaacctga ggccacagaa gtgtcagcgc tttatccatc cccagtggca gctccttcac 300 
ctgaacggga ccttccccaa cacaaatgag ccagacacgg agccctgtgt ggatggctgg 360 
gtgtacgaca gaagctcttt cctctccacc atcgtgactg agtgggacct ggtatgtgaa 420 
tctcagtcac taaaatcaat ggttcaatcc ctatttatgg ctgggtcact gctgggaggt 480 
ctaatatatg gccatctttc agacagggtt ggacggaaga tcatatgcaa attgtgtttc 540 
ctccagctgg ccatctctaa cacctgtgcg gccttcgctc ccaccttcct tgtttactgc 600 
atactgcgct tcttggcagg gttctccacc atgactattt tgggaaacac ttttattctc 660 
agcttagagt ggacattgcc ccggtcacga tctatgacaa taatggtgct attatgttcc 720 
tacagtgttg ggcagatgct cctaggaggg ctggcttttg ccattcagga ctggcacata 780 
ttgcaactga ctgtgtctac acccataatt gtcctcttct tgtcctcttg gaagatggtg 840 
gagtctgctc ggtggctgat tatcaacaat cagctagatg agggcttaaa ggagcttaga 900 
agagttgcac acataaatgg aaaaaagaat actgaagaga cactgaccac tgagcttgtg 960 
agatccacca tgaagaagga gttggatgca gtccgaatta aaacatccat tttttccctg 1020 
ttccgtgcac ccaaattgcg aatgagagtc ttcggcctgt gctttgtgag attcgcaatc 1080 
actgtaccct tttatggcct gatactcaac ttgcagcact tagggagcaa tgtctccctg 1140 
ttccagattc tctgtggagc tgtcacattc acagccagat gtgtttccct tttgacactg 1200 
aatcatatgg gtcgtcgaat aagccagata ttgttcacgt tcccggtggg acttttcatt 1260 
ctggtcaaca cctttttgcc ccaagaaatg cagatcctgc gtgtggtttt agcaactttg 1320 
ggaattggta gtgtttctgc tgctagcaac agtgcttctg tccaccacaa cgagctcgtc 1380 
cccaccatat tgaggtcaac agttgcagga atcaatgcag tgtccggtag gactggggca 1440 
gcactggctc ctctgttgat gaccttaatg gcgtattctc cccacctacc ctggatttcc 1500 
tatggagtct tccccatcct tgctgtccct gttatcctcc tccttccaga aaccagggat 1560 
ctacctcttc ctaacaccat ccaggatgtg gaaaatgaca gaaaagattc aagaaacata 1620 
aagcaggaag atacttgcat gaaagtaaca cagttttaa 1659 
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